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(57) Abstract 

' The present invention relates to the mammalian HKNGJ gene, a gene associated with bipolar ^j^f^'S 
The invention relates in particular, to methods for the diagnostic evaluation, genetic testing and prognosis of HKNG1 neuropsychiatry 

attention deficit disorder.^ schizoaffective disorder, a bipolar affect.ve- disorder or a unipolar affecuve 

disorder. 
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METHODS AND COMPOSITIONS FOR DIAGNOSING AND CREATING 
CHROMOSOME-ISP RELATED DISORDERS 

5 

This is a continuation-in-part of U.S. application no. 
09/236,134, filed on January 22, 1999 which claims priority 
under 35 U.S.C. § 119(e)(1) of provisional application no. 
60/078,044 filed on March 16, 1998, of provisional 
application no. 60/088,312 filed on June 5, 1998, and of 
10 provisional application no. 60/106,056 filed on October 28, 
1998, each of which is incorporated herein by reference in 
its entirety. 

1. INTRODUCTION 
The present invention relates, first, to the HKNG1 gene, 
15 shown herein to be associated with central nervous system- 
related disorders, e.g. f neuropsychiatry disorders, in 
particular, bipolar affective disorder and schizophrenia and 
with myopia-related disorders. The invention includes 
recombinant DNA molecules and cloning vectors comprising 
sequences of the HKNG1 gene, and host cells and non-human 

20 

host organisms engineered to contain such DNA molecules and 
cloning vectors . The present invention further relates to 
HKNG1 gene products, and to antibodies directed against such 
HKNG1 gene products. The present invention also relates to 
methods of using the HKNG1 gene and gene product, including 
25 drug screening assays, and diagnostic and therapeutic methods 
for the treatment of HKNG1 -mediated disorders, including 
HKffGl-mediated neuropsychiatric disorders such as bipolar 
affective disorder, as well as HKNG2 -mediated myopia 
disorders such as early-onset autosomal dominant myopia. 
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2» BACKGROUND OF THE INVENTION 
There are only a few psychiatric disorders in which 
clinical manifestations of the disorder can be correlated 
with demonstrable defects in the structure and/ or function of 
the nervous system. Well-known examples of such disorders 

5 

include Huntington 1 s disease, which can be traced to a 
mutation in a single gene and in which neurons in the 
striatum degenerate, and Parkinson 1 © disease, in which 
dopaminergic neurons in the nigro-striatal pathway 
degenerate. The vast majority of psychiatric disorders, 
however, presumably involve subtle and/or undetectable 
10 changes, at the cellular and/ or molecular levels, in nervous 
system structure and function. This lack of detectable 
neurological defects distinguishes "neuropsychiatries 
disorders, such as schizophrenia, attention deficit 
disorders, schizoaffective disorder, bipolar affective 
disorders, or unipolar affective disorder, from neurological 

15 

disorders, in which anatomical or biochemical pathologies are 
manifest. Hence, identification of the causative defects and 
the neuropathologies of neuropsychiatric disorders are needed* 
in order to enable clinicians to evaluate and prescribe 
appropriate courses of treatment to cure or ameliorate the 
symptoms of these disorders. 
20 One of the most prevalent and potentially devastating of 

neuropsychiatric disorders is bipolar affective disorder 
(BAD) , also known as bipolar mood disorder (BP) or manic- 
depressive illness, which is characterized by episodes of 
elevated mood (mania) and depression (Goodwin, et al • , 1990, 

Manic Depressive Jllness, Oxford University Press, New York). 
The most severe and clinically distinctive forms of BAD are 
BP-I (severe bipolar affective (mood) disorder) / which 
affects 2-3 million people in the United States, and SAD-SI 
(schizoaffective disorder manic type) • They are 
characterized by at least one full episode of mania , with or 
30 without episodes of major depression (defined by lowered 

mood, or depression, with associated disturbances in rhythmic 
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behaviors such as sleeping, eating, and sexual activity) . 
BP-I often co-segregates in families with more etiologically 
heterogeneous syndromes, such as with a unipolar affective 
disorder such as unipolar major depressive disorder (MDD) , 
which is a more broadly defined phenotype (Freimer and Reus, 
5 1992, in The Molecular and Genetic Basis of Neurological 

Disease, Rosenberg, et.al., eds., Butterworths, New York, pp. 
951-965; Mclnnes and Freimer, 1995, Curr. Opin. Genet. 
Develop,, 5, 376-381), BP-I and SAD-M are severe mood 
disorders that are frequently difficult to distinguish from 
one another on a cross-sectional basis, follow similar 

xo 

clinical courses , and segregate together in family studies 
(Rosenthal, et al., 1980, Arch. General Psychiat. 37, 804- 
810; Levinson and Levitt, 1987, Am. J. Psychiat. 144, 415- 
426; Goodwin, et al M 1990, Manic Depressive Illness, Oxford 
University Press, New York). Hence, methods for 
15 distinguishing neuropsychiatry disorders such as these are 
needed in order to effectively diagnose and treat afflicted 
individuals. 

Currently, individuals are typically evaluated for BAD 
using the criteria set forth in the most current version of 
the American Psychiatric Association's Diagnostic and 

20 Statistical Manual of Mental Disorders (DSM) • While many 

drugs have been used to treat individuals diagnosed with BAD, 
including lithium salts, carbamazepine and valproic acid, 
none of the currently available drugs are adequate. For 
example, drug treatments are effective in only approximately 
60-70% of individuals diagnosed with BP-I. Moreover, it is 

25 currently impossible to predict which drug treatments will be 
effective in, for example, particular BP-I affected 
individuals. Commonly, upon diagnosis, affected individuals 
are prescribed one drug after another until one is found to 
be effective. Early prescription of an effective drug 
treatment, therefore, is critical for several reasons, 

30 including the avoidance of extremely dangerous manic 

episodes, the risk of progressive deterioration if effective 
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treatments are not found, and the risk of substantial side 
effects of current treatments, 

The existence of a genetic component for BAD is strongly 
supported by segregation analyses and twin studies 

(Bertelscn, et al., 1977, Br. J. Psychiat. 130, 330-351; 

5 . ■ ■ 

Freimer and Reus, 1992, xn The Molecular and Genetic Basis of 

Neurological Disease, Rosenberg, et al . , eds., Butterworths , 

New York, pp. 951-965; Pauls, et al., 1992, Arch. Gen. 

Psychiat. 49, 703-708). Efforts to identify the chromosomal 
location of genes that might be involved in BP-I, however, 
10 have yielded disappointing results in that reports of linkage 
between BP-I and markers on chromosomes X and 11 could not be 
independently replicated nor confirmed in the re-analyses of 
the original pedigrees, indicating that with BAD linkage 
studies, even extremely high lod scores at a single locus, 
can be false positives (Baron, et al*, 1987, Nature 326, 289- 

15 292; Egeland, et al., 1987, Nature 325, 783-787; Kelsoe, et 

al., 1989, Nature 342, 238-243; Baron, et al . , 1993, Nature 

Genet. 3, 49-55). 

Recent investigations have suggested possible 
localization of BAD genes on chromosomes 18p and 21q, but in 

20 both cases the proposed candidate region is not well defined 
and no unequivocal support exists for either location 
(Berrettini, et al., 1994, Proc. Natl. Acad. Sci. USA 91 , 
5918-5921; Murray, et al., 1994, Science 265, 2049-2054; 
Pauls, et al., 1995, Am. J. Hum. Genet. 57, 636-643; Maier, 
et al., 1995 , Psych. Res. 59, 7-15; Straub, et al., 1994, 

25 Nature Genet. 8, 291-296) . 

Mapping genes for common diseases believed to be caused 
by multiple genes, such as BAD, may be complicated by the 
typically imprecise definition of phenotypes, by etiologic 
heterogeneity, and by uncertainty about the mode of genetic 
transmission of the disease trait. With neuropsychiatric 

30 

disorders there is even greater ambiguity in distinguishing 
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individuals who likely carry an affected genotype from those 
who are genetically unaffected. For example, one can define 
an affected phenotype for BAD by including one or more of the 
broad grouping of diagnostic classifications that constitute 
the mood disorders! BP-I, SAD-M, MDD, and bipolar affective 
5 (mood) disorder with hypomania and major depression (BP-II) . 

Thus, one of the greatest difficulties facing 
psychiatric geneticists is uncertainty regarding the validity 
of phenotype designations, since clinical diagnoses are based 
solely on clinical observation and subjective reports. Also, 
with complex traits such as neuropsychiatry disorders, it is 
xo difficult to genetically map the trait-causing genes because: 
(1) neuropsychiatric disorder phenotypes do not exhibit 
classic Mendelian recessive or dominant inheritance patterns 
attributable to a single genetic locus, (2) there may be 
incomplete penetrance, i.e., individuals who inherit a 
predisposing allele may not manifest disease; (3) a phenocopy 
phenomenon may occur, i.e., individuals who do not inherit a 
predisposing allele may nevertheless develop disease due to 
environmental or random causes; (4) genetic heterogeneity may 
exist, in which case mutations in any one of several genes 
may result in identical phenotypes. 

Despite these difficulties, however, identification of 

20 

the chromosomal location, sequence and function of genes and 
gene products responsible for causing neuropsychiatric 
disorders such as bipolar affective disorders is of great 
importance for genetic counseling, diagnosis and treatment of 
individuals in affected families. 

25 

3. SUMMARY OF THE INVENTION 
The present invention relates, first, to the discovery, 
identification, and characterization of novel nucleic acid 
molecules that are associated with central nervous system- 
related disorders and processes, e.gr., human neuropsychiatric 
30 disorders, such as schizophrenia, attention deficit disorder, 
schizoaffective disorder, dysthymic disorder, major 
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depressive disorder, and bipolar affective disorder (BAD) 
including severe bipolar affective (mood) disorder (BP-I) , 
bipolar affective (mood) disorder with hypomania and major 
depression (BP-II) . The invention further relates to the 
discovery, identification , and characterization of proteins 
5 encoded by such nucleic acid molecules, or by degenerate, 
e.gr., allelic or homologous, variants thereof. The invention 
further relates to the discovery, identification, and 
characterization of novel nucleic acid molecules that are 
associated with human myopia or nearsightedness, such as 
early-onset, autosomal dominant myopia, as well as to the 
10 discovery, identification, and characterization of proteins 
encoded by such nucleic acid molecules or by degenerate 
variants thereof. 

In particular, the nucleic acid molecules of the present 
invention represent, first, nucleic acid sequences 
corresponding to the gene referred to herein as HKNG1 . As 

15 

demonstrated in the Examples presented below in Sections 6, 8 
and 14, the HKNGl gene is associated with human CNS-related 
disorders, e.g., neuropsychiatric disorders, in particular 
BAD . The HKNG1 gene is associated with other human 

neuropsychiatric disorders as well, such as schizophrenia* 
20 As demonstrated in the Example presented below in Section 14, 
the HKNG1 gene is also associated with human myopia, such as 

early-onset autosomal dominant myopia. 

In addition to the positive correlation between 
mutations within the HKNGl gene and individuals exhibiting 

2S symptoms of BAD, described in Section 6 and 8, the present 
invention is further based, in part, on Applicants 1 discovery 
of novel HKNGl cDNA sequences. Applicants 1 discovery of such 

cDNA sequences has led to the elucidation of the HKNGl 
genomic (that is, upstream untranslated, intron/exon, and 
downstream untranslated) structure, and to the discovery of 
30 full-length and alternately spliced HKNGl variants and the 
polypeptides encoded by such variants. These discoveries are 
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described in Sections 7 and 10 , below. Applicants' discovery 
of such cDNA sequences has also led to the elucidation of 
novel mammalian (e.g., guinea pig and bovine) HKNG1 
sequences, and to the discovery of novel allelic variants and 
polymorphisms of such sequences. These discoveries are 

5 

described in Section 10 below. 

The invention encompasses nucleic acid molecules which 
comprise the following nucleotide sequences: (a) nucleotide 
sequences (e.g., SEQ ID NOS: 1, 3, 5, 6, 36, and 37) that 

comprise a human HKNG2 gene and/ or encode a human HKNGl gene 
10 product (e.g., SEQ ID NO: 2; SEQ ID NO: 4), as well as 

allelic variants, homologs and orthologs thereof, including 
nucleotide sequences (e.g., SEQ ID NOS:38, 40, 42, 44, and 

46-48) that encode non-human HKNGl gene products (e»g., SEQ 
ID NOS:39, 41, 43, 45, and 49); (b) nucleotide sequences 
comprising the novel HKNG1 sequences disclosed herein that 

15 

encode mutants of the HKNG1 gene product in which sequences 
encoding all or a part of one or more of the HKNGl domains is 
deleted or altered, or fragments thereof; (c) nucleotide 
sequences that encode fusion proteins comprising a HKNGl gene 

product (e.g., SEQ ID NO: 2; SEQ ID NO: 4), or a portion 

20 

thereof fused to a heterologous polypeptide; and (d) 
nucleotide sequences within the HKNGl gene, as well as 
chromosome I8p nucleotide sequences flanking the HKNGl gene, 
which can be utilized, e.g., as primers, in the methods of 
the invention for identifying and diagnosing individuals at 
25 risk for or exhibiting an HKNGZ -mediated disorder, such as 

BAD or schizophrenia, or for diagnosing individuals at risk 
for or exhibiting a form of myopia such as early-onset 
autosomal dominant myopia. The nucleic acid molecules of (a) 
through (d) , above, can include, but are not limited to, 
cDNA, genomic DNA, and RNA sequences. 

30 
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The invention also encompasses the expression products 
of the nucleic acid molecules listed above; i-e., peptides, 
proteins, glycoproteins and/ or polypeptides that are encoded 
by the above HKNG1 nucleic acid molecules* 

■ The compositions of the present invention further 

encompass agonists and antagonists of the HKNG1 gene product, 

including small molecules (such as small organic molecules) , 
and macromolecules (including antibodies) , as well as 
nucleotide sequences that can be used to inhibit HKNG1 gene 
expression (e.g., antisense and ribozyme molecules, and gene 
10 or regulatory sequence replacement constructs) or to enhance 
HKNGl gene expression (e-gr., expression constructs that place 
the HKNGl gene under the control of a strong promoter 
system) . 

The compositions of the present invention further 
include cloning vectors and expression vectors containing the 

15 

nucleic acid molecules of the invention, as well as hosts 

which have been transformed with such nucleic acid molecules, 

including cells genetically engineered to contain the nucleic 

acid molecules of the invention, and/or cells genetically 

engineered to express the nucleic acid molecules of the 

-J% - invention. In addition to host cells and cell lines, hosts 
20 

also include transgenic non-human animals (or progeny 
thereof) , particularly non-human mammals, that have been 
engineered to express an HKNGl. transgene, or "knock-outs" 

that have been engineered to not express HKNGl . 

Transgenic non-human animals of the invention include 

25 animals engineered to express an HKNGl transgene at higher or 
lower levels than normal, wild-type animals. The transgenic 
animals of the invention also include animals engineered to 
express a mutant variant or polymorphism of an HKNGl 
transgene which is associated with HKNGl -mediated disorder, 

~ 0 for example an HKNGl -mediated neuropsychiatry disorders , 
such as BAD and schizophrenia, or, alternatively, a myopia 
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disorder such as early-onset autosomal dominant myopia. The 
transgenic animals of the invention further include the 
progeny of such genetically engineered animals. 

The invention further relates to methods for the 
treatment of HKNG1 -mediated disorders in a subject, such as 

5 

HKNG1 -mediated neuropsychiatric disorders and HKNG1 -mediated 

myopia disorders, wherein such methods comprise administering 
a compound which modulates the expression of a HKNG1 gene 

and/or the synthesis or activity of a HKNG1 gene product so 

symptoms of the disorder are ameliorated, 
10 The invention further relates to methods for the 

treatment of HKNG1 -mediated disorders in a subject, such as 

BKNG1 -mediated neuropsychiatric disorders and HKNG1 -mediated 
myopia disorders, resulting from HKNG1 gene mutations or 
aberrant levels of HKNG1 expression or activity, wherein such 
15 methods comprise supplying the subject with a nucleic acid 
molecule encoding an unimpaired HKNG1 gene product such that 

an unimpaired HKNG1 gene product is expressed and symptoms of 
the disorder are ameliorated. 

The invention further relates to methods for the 
treatment of HKNG1 -mediated disorders in a subject, such as 

20 

EKNGl -mediated neuropsychiatric disorders and tfiOTGl-mediated 

myopia disorders, resulting from HKNG1 gene mutations or from 

aberrant levels of expression or activity, wherein such 
methods comprise supplying the subject with a cell comprising 
a nucleic acid molecule that encodes an unimpaired HKNG1 gene 

25 product such that the cell expresses the unimpaired HKNG1 
gene product and symptoms of the disorder are ameliorated. 

The invention also encompasses pharmaceutical 
formulations and methods for treating HKNG1 -mediated 

disorders, including neuropsychiatric disorders, such as BAD 
and schizophrenia, and myopia disorders, such as early-onset 

30 

autosomal dominant myopia, involving HKNG1 gene. 
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In addition, the present invention is directed to 
methods that utilize the HKNG1 nucleic acid sequences, 

chromosome 18p nucleotide sequences flanking the HKNGl human 

gene and/or HKNGl gene product sequences for mapping the 

5 chromosome 18p region, and for the diagnostic evaluation, 
genetic testing and prognosis of a HKNGl -mediated disorder, 
such as a HKNGl -mediated neuropsychiatry disorder or a 
HKNGl -mediated myopia disorder* For example, in one 

embodiment, the invention relates to methods for diagnosing 
HKNGl -mediated disorders, wherein such methods comprise 

10 

measuring HKNGl gene expression in a patient sample, or 
detecting a HKNGl polymorphism or mutation in the genome of a 

mammal, including a human, suspected of exhibiting such a 
disorder. Xn one embodiment, nucleic acid molecules encoding 
HKNGl can be used as diagnostic hybridization probes or as 
15 primers for diagnostic PCR analysis for the identification of 
HKNGl gene mutations, allelic variations and regulatory 

defects in the HKNGl gene which correlate with 

neuropsychiatric disorders such as BAD or schizophrenia. 

The invention still further relates to methods for 
identifying compounds which modulate the expression of the 
HKNGl gene and/or the synthesis or activity of the HKNGl gene 

products, including therapeutic compounds, which reduce or 
eliminate the symptoms of mCJJ r G2«-mediated disorders, including 

HKNGl -mediated neuropsychiatry disorders such as BAD and 
schizophrenia. In particular, cellular and non-cellular 
25 assays are described that can be used to identify compounds 
that interact with the HKNGl gene product, e.g . f modulate the 

activity of the HKNGl and/ or bind to the HKNGl gene product. 
Such cell-based assays of the invention utilize cells, cell 
lines, or engineered cells or cell lines that express the 
HKNGl gene product . 
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In one embodiment, such methods comprise contacting a 
compound to a cell that expresses a HKNG1 gene, measuring the 

level of HKNG1 gene expression, gene product expression or 
gene product activity, and comparing this level to the level 
of HKNG1 gene expression, gene product expression or gene 

product activity produced by the cell in the absence of the 
compound, such that if ~the level obtained in the presence of 
the compound differs from that obtained in its absence, a 
compound that modulates the expression of the HKNG1 gene 

and/ or the synthesis or activity of the HKNGl gene products 

10 has been identified. 

In another embodiment, such methods comprise contacting 
a compound to a cell that expresses a HKNGl gene and also 

comprises a reporter construct whose transcription is 
dependent, at least in part, on HKNGl expression or activity. 
In such an embodiment, the level of reporter transcription is 
measured and compared to the level of reporter transcriptxon 
in the cell in the absence of the compound* If the level of 
reporter transcription obtained in the presence of the 
compound differs from that obtained in its absence, a 
compound that modulates expression of HKNGl or genes involved 
20 in HJOTGl-related pathways or signal transduction has been 
identified. 

In yet another embodiment, such methods comprise 
administering a compound to a host, such as a transgenic """" 
animal, that expresses an HKNGl transgene or a mutant HKNGl 
transgene associated with an HKNGl -mediated disorder such as 

25 . • 

a neuropsychiatric disorder {e.g., BAD or schxzophrenxa) , or 
to an animal, e.g., a knock-out animal, that does not express 
HKNGl, and measuring the level of HKNGl gene expression, gene 
product expression, gene product activity, or symptoms of an 
BJOTGl-mediated disorder such as an HKNGl -mediated 
30 neuropsychiatric disorder (e.gr., BAD or schizophrenia). The 
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measured level is compared to the level obtained in a host 
that is not exposed to the compound, such that if the level 
obtained when the host is exposed to the compound differs 
from that obtained in a host not exposed to the compound, a 
compound modulates the expression of the mammalian HKNG1 gene 
and/or the synthesis or actxvity of the mammalxan HKNG1 gene 
products, and/ or the symptoms of an HKNG1 -mediated disorder 
such as a neuropsychiatric disorder (e.g., BAD or 

schizophrenia) , has been identified, 

The present invention still further relates to 
10 phannacogenomic and pharmacogenetic methods for selecting an 
effective drug to administer to an individual having a HKNG1- 

mediated disorder. Such methods are based on the detection 
of genetic polymorphisms in the HKNG1 gene or variations in 

HKNG1 gene expression due to, e.gr. , altered methylat ion, 

differential splicing, or pbst-translational modification of 

15 

the HKNG1 gene product which can affect the safety and 

efficacy of a therapeutic agent. 

As briefly discussed above, the present invention is 
based, in part, on the genetic and physical mapping of the 
HKNG1 gene to a specific portion of the short arm of human 

20 chromosome 18 that is associated with human neuropsychiatric 
disorders, in particular, bipolar affective disorder. These 
results are described in the Example presented, below, in 
Section 6. The invention is also based on the elucidation of 
the HKGN1 nucleotide sequence, amino acid sequence and 

expression pattern, as described in the Example presented, 
28 below, in Section 7. The invention is further based on the 
identification of specific mutations and/or polymorphisms 
within the HKNG1 gene which positively correlate with 
neuropsychiatric disorders, in particular, BAD, as described 
in the Example presented below in Section 8. These mutations 

" include a point mutation discovered in an individual affected 
30 

by BAD which is absent from the corresponding wild-type 
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nucleic acid derived from non-affected individuals and 
linkage disequilibrium of three markers showing cosegregation 
with a population of individuals with BAD. This mutation is 
a single base mutation which results in a mutant HKNG2 gene 
product comprising substitution of a lysine residue for the 
5 wild-type glutamic acid residue at HKNG1 amino acid position 
202 of the polypeptide of SEQ ID NO: 2 or the HKNG1 amino acid 
residue 184 of the polypeptide of SEQ ID N0:4. These 
mutations further include the mutations discovered in 
schizophrenic and BAD patients that are detailed in FIGS. 5A- 

10 5B - 

3.X. DEFINITIONS 

As used herein, the following terms shall have the 
abbreviations indicated . 

15 BAG, bacterial artificial chromosomes 

BAD, bipolar affective disorder (s) 
BP, bipolar mood disorder 

BP-I, severe bipolar affective (mood) disorder 
BP-II, bipolar affective (mood) disorder with 
hypomania and major depression 
20 bp, base pair(s) 

EST, expressed sequence tag 
HKNG1, Hong Kong new gene 1 
lad, logarithm of odds 

MDD, unipolar major depressive disorder 

ROS, reactive oxygen species 
25 KT-PCR, reverse transcriptase PGR 

SSCP, single-stranded conformational polymorphism 

SAD-M, schizoaffective disorder manic type 

STS, sequence tagged site 

YAC, yeast artificial chromosome 
"HKNG1 -mediated disorders" include disorders involving 
30 an aberrant level of HKNG1 gene expression, gene product 
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synthesis and/or gene product activity relative to levels 
found in clinically normal individuals, and/or relative to 
levels found in a population whose level represents a 
baseline, average HKNG1 level. While not wishing to be bound 
by any particular mechanism, it is to be understood that 
5 disorder symptoms can, for example, be caused, either 
directly or indirectly , by such aberrant levels. 
Alternatively, it is to be understood that such aberrant 
levels can, either directly or indirectly, ameliorate 
disorder symptoms, (e.g., as in instances wherein aberrant 

levels of HKNG1 suppress the disorder symptoms caused by 

10 

mutations within a second gene) • 

HNKG1 -mediated disorders include, for example, central 

nervous system (CNS) disorders. CNS disorders include, but 
are not limited to cognitive and neurodegenerative disorders 
such as Alzheimer^ disease, senile dementia, Huntington 1 s 

15 disease, amyotrophic lateral sclerosis, and Parkinson's 
disease, as well as Gilles de la Tourette's syndrome, 
autonomic function disorders such as hypertension and sleep 
disorders, and neuropsychiatric disorders that include, but 
are not limited to schizophrenia, schizoaffective disorder, 
attention deficit disorder, dysthymic disorder, major 

20 depressive disorder, mania, obsessive-compulsive disorder, 
psychoactive substance use disorders, anxiety, panic . 
disorder, as well as bipolar affective disorder , e.g., severe 
bipolar affective (mood) disorder (BP-I) , bipolar affective 
(mood) disorder with hypomania and major depression (BP-II) . 
Further CNS-related disorders include, for example, those 

25 listed in the American Psychiatric Association's Diagnostic 
and Statistical manual of Mental Disorders (DSM) , the most 
current version of which is incorporated herein by reference 
in its entirety. 

"HKNG1 -mediated processes" include processes dependent 

- and/or responsive, either directly or indirectly, to levels 
30 of HKNGl gene expression, gene product synthesis and/or gene 
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product activity. Such processes can include, but are not 
limited to, developmental, cognitive and autonomic neural and 
neurological processes, such as, for example, pain, appetite, 
long term memory and short term memory* 

5 4. BRIEF DESCRIPTION OF THE FIGURES 

FIG. 1A-1B. The nucleotide sequence of human HKNG1 

cDNA (SEQ ID NO: 1) is depicted on the bottom line. The top 
line depicts the full length amino acid sequence of human 
HKNG1 polypeptide (SEQ ID NO: 2) encoded by the human HKNG1 

10 cDNA sequence. The nucleotide sequence encoding SEQ ID NO: 2 
corresponds to SEQ ID NO: 5. 

FIG. 2A-2B. Nucleotide sequence of an alternately 
spliced human HKNG1 variant, referred to as HKNG1-V1 (SEQ ID 

NO: 3) is depicted on the bottom line. The derived amino 
acid sequence of the human HKNG1 gene product (SEQ ID NO: 4) 

15 encoded by this alternately spliced cDNA variant is depicted 
on the top line. The nucleotide sequence encoding SEQ ID 
NO: 4 corresponds to SEQ ID NO: 6 

FIG. 3A-3R. Genomic sequence of the human HKNG1 gene 

(SEQ ID NO. 7). Exons are in bold and the 3 1 and 5 f UTRs 
(untranslated regions) are underlined. 

20 

FIG. 4» Summary of in situ hybridization analysxs of 
HKNG1 mBNA distribution in normal human brain tissue. 

FIGS. 5A-B. HKNG2 polymorphisms relative to the HKNG1 

wild-type sequence. These polymorphisms vere isolated from a 
collection of schizophrenic patients of mixed ethnicity from 
25 the United States (FIG. 5A) and from the San Francisco BAD 
collection (FIG. 5B) . 

FIGS. 6A-B. Nucleotide sequence of the RT-PCR products 
for HKNG1-V2 (FIG. 6A; SEQ ID NO:36) and BKNG1-V3 (FIG. 6B; 

SEQ ID NO: 37) . 

FIG 7. The cDNA sequence (SEQ ID NO: 38) and the 
30 predicted amino acid sequence (SEQ ID NO: 39) of the guinea 
pig HKNGl ortholog gphknglBlS* 
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FIG. 8. The cDNA sequence (SEQ ID NO: 40) and the 
predicted amino acid sequence (SEQ ID NO: 41) of gphkng 7b, an 
allelic variant of the guinea pig HKNG1 ortholog gphknglBlS. 

FIG . 9 . The cDNA sequence (SEQ ID NO: 42) and the 
predicted amino acid sequence (SEQ ID NO: 43) of gphkng 7c, an 
allelic variant of the guinea pig HKNG1 ortholog gphknglB15. 

FIG. 10. The cDNA sequence (SEQ ID NO: 44) and the 
predicted amino acid sequence (SEQ ID NO: 45) of gphkng 7d, an 
allelic variant of the guinea pig HKNG1 ortholog gphkngisis. 

FIG. 11. The cDNA sequence (SEQ -ID NO: 46) and the 
predicted amino acid sequence (SEQ ID NO: 49) of the allelic 
variant bhkngl of the bovine HKNG1 ortholog. 

FIG. 12. The cDNA sequence (SEQ ID NO: 47) and the 
predicted amino acid sequence (SEQ ID NO: 49) of the allelic 
variant bhkng2 of the bovine HKNG1 homolgue. 

FIG. 13. The cDNA sequence (SEQ ID NO: 48) and the 
predicted amino acid sequence (SEQ ID NO: 49) of the allelic 
variant bhkng3 of the bovine HKNG1 homolgue. 

FIG. 14A-B. Alignments of the guinea pig HKNGl cDNA 

(FIG. 14A) and predicted amino acid (FIG. 14B) sequences for 
gphkngl815, gphkng 7b, gphkng7c, and gphkng 7d. 

FIG. 15. Alignments of the cDNA sequences of the bovine 
HKNGl allelic variants bhkngl, bhkng2, and bhkng3. 

FIG. 16. Alignments of the human (hkng_aa) , bovine 
(bhkngl_aa) and guinea pig (gphkngl815_aa) HKNGl amino acid 
sequences. 

FIG. 17. Alignments of the human HKNGl protein 
sequences; top line: the mature secreted HKNGl protein 
sequence (SEQ ID NO: 51) ; second line: immature HKNGl protein 
form 1 (IPFl; SEQ ID NO: 2); third line: immature HKNGl 
protein form 2 (IPF2; SEQ ID NO: 64) ; bottom line: immature 
HKNGl protein form 3 (IPF3; SEQ ID NO: 4). 

FIG. 18. The nucleotide sequence of human HKNGl splice 
variant HKNGl A7 cDNA (SEQ ID NO: 65) is depicted on the 
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bottom line. The top line depicts the full length amino acid 
sequence of human HKNG1A7 polypeptide (SEQ ID NO: 66) encoded 
by the human HKNG1A7 cDNA sequence. 

5. DETAILED DESCRIPTION OF THE INVENTION 

5 

5.1. THE HKNG1 GENE 

HKNG1 nucleic acid molecules are described in the 
section. Unless otherwise stated , the term "HKNG1 nucleic 
acid" refers collectively to the sequences described herein. 

A human HKNG1 cDNA sequence (SEQ ID NO: 1) encoding the 

10 full length amino acid sequence (SEQ ID NO: 2) of the HKNG1 

polypeptide is shown in FIG. 1A-1B. The human HKNG1 gene 

encodes a secreted polypeptide of 495 amino acid residues, as 
shown in FIG. 1A-1B, and SEQ ID NO: 2. The nucleotide 
sequence of the portion of the cDNA corresponding to the 
coding sequence for HKNG1 (SEQ ID NO:2) is depicted as SEQ ID 

15 

NO: 5. 

The HKNG1 sequences of the invention also include splice 
variants of the HKNG1 sequences described herein. For 
example, an alternately spliced human HKNG1 cDNA sequence, 
referred to as HKNG1-V1 (SEQ ID NO: 3) encoding a human HKNG1 

20 

variant gene product (i.e., the HKNG1-V1 gene product) is 
shown in FIG. 2A-2B. This splice variant of a human HKNG1 

gene encodes a secreted polypeptide of 477 .amino acid 
residues, as shown in FIG. 2A-2B, and SEQ ID NO: 4. The 
nucleotide sequence of the portion of the cDNA corresponding 
25 to the coding sequence for HKNG1 (SEQ ID NO: 4) is depicted in 
SEQ ID NO: 6. 

Another alternately spliced human BKNG1 cDNA sequence 
(SEQ ID NO: 65) , referred to as HKNG1A7, encodes a second" 
HKNG1 variant gene product (the HKNG1A7 gene product) which 
~ is depicted in FIG. 18. This splice variant of the human 
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HKNG1 gene encodes the variant polypeptide shown in FIG* 18 

(SEQ ID NO: 66) . 

The genomic structure of the human HKNG1 gene has been 

elucidated and is depicted in FIG. 3A-3R, with the HKNG1 
5 exons indicated in bold type, and the 5'- and 3 1 -untranslated 
regions indicated by underlining. The wild-type genomic 
sequence of the HKNG1 gene is depicted in FIG. 3A-3R and SEQ 

ID NO: 7. 

Non-human homo logues or ortho logs mammalian orthologs, 
e.g. , of the human HKNG1 sequences discussed above are also 
10 provided. Specifically, a guinea pig cDNA sequence (SEQ ID 
NO: 38), referred to herein as gphknglBIS, encoding the full 
length amino acid sequence (SEQ ID NO: 39) of a guinea pig 
HKNG1 ortholog is shown in FIG. 7. This guinea pig cDNA 
sequence encodes a gene product of 466 amino acid residues, 
as shown in FIG. 7 and in SEQ ID NO: 39. 

is 

Allelic variants of this guinea pig HKNG1 ortholog, 
referred to as gphkng 7b, gphkng 7c, and gphkng 7d (SEQ ID 
NOS:40, 42, and 44, respectively), are shown in FIGS. 8-10, 
respectively. The allelic variants gphkng7b, gphkng7c, and 
gphkng7d each encode variants of the guinea pig gphkngl815 
%° MKNG1 gene product which contain deletions of 16, 92, and 93 
amino acids, respectively, as shown in FIGS. 8-10, in SEQ ID 
NOS:41, 43, and 45, respectively, and in the sequence 
alignment in FIG. 14B. 

Bovine HKNG1 ortholog cDNA sequences (SEQ ID NOS: 46- 

48), referred to herein as bhkngl, bhkng2, and bhkng3 , and 

25 

each encoding the same bovine ortholog gene product are shown 
in FIGS. 11-13, respectively. The bovine HKNG1 allelic 
variants encode the same gene product, i.e., a 465 amino acid 
protein, as shown in FIGS. 11-13 and in SEQ ID NO: 49. 

The HKNGl gene nucleic acid molecules of the invention 
30 include: (a) nucleotide sequences and fragments thereof 



- 18 - 



WO 99/47535 



PCT/US99/05606 



( e.g. r SEQ ID NOS: 1, 3, 5, 6, 7 r 36, 37 , and 65) that encode 
a HKNG1 gene product (e.g., SEQ ID NOS: 2, 4 and 66), as well 

as homologues, orthologs and allelic variants of such 
sequences and fragments thereof (e.g., SEQ ID N0S:38, 40, 42 , 

44 , and 46-43) which encode homologue or ortholog HKNG1 gene 
products (e.g., SEQ ID NOS:39, 41, 43, 45, and 49); <b) 
nucleotide sequences that encode one or more functional 
domains of a HKNG1 gene product including, but not limited 
to, nucleic acid sequences that encode a signal sequence 
domain, or one or more cluster in domains as described in 
*° Section 5.2 below; (c) nucleotide sequences that comprise 
HKNG1 gene sequences of upstream untranslated regions, 
intronic regions, and/or downstream untranslated regions or 
fragments thereof of the HKNG1 nucleotide sequences in (a) 

above; (d) nucleotide sequences comprising the novel HKNG1 
sequences disclosed herein that encode mutants of the HKNG1 

gene product in which all or a part of one or more of the 
domains is deleted or altered, as well as fragments thereof; 
(e) nucleotide sequences that encode fusion proteins 
comprising a HKNG1 gene product (e.g., SEQ ID NO: 2, 4, 39, 
41, 43, 45, 49 and 65), or a portion thereof fused to a 
20 heterologous polypeptide; and (f) nucleotide sequences (e.g., 

primers) within the HKNG1 gene, and chromosome 18p nucleotide 
sequences flanking the HKNG1 gene which can be utilized as 
part of the methods of the invention for identifying and 
diagnosing individuals at risk for or exhibiting an HKNG1- 

25 mediated disorder, such as BAD, or myopia. 

The HKNG1 nucleotide sequences of the invention further 

include nucleotide sequences corresponding to the nucleotide 
sequences of (a) -(f) above wherein one or more of the exons, 
or fragments thereof, have been deleted. In one preferred 
- embodiment, the HKNG1 nucleotide sequence of the invention is 



- 19 - 



PCT/US99/05606 

WO 99/47535 

a sequence wherein the exon corresponding to exon 7 of SEQ ID 
NO: 7, or a fragment thereof, has been deleted. 

The HKNG1 nucleotide sequences of the invention also 

include nucleotide sequences that have at least 65%, 70%, 
75%, 80%, 85%, 90%, 95%, 98%, or more nucleotide sequence 

5 

identity to the HKNG1 nucleotide sequences of (a) -(f) above. 
The HKNG1 nucleotide sequences of the invention further 
include nucleotide sequences that encode polypeptides having 
at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or higher 
amino acid sequence identity to the polypeptides encoded by 
10 the HKNG1 nucleotide sequences of (a) -(f), e.g. . SEQ ID NOS: 

2, 4, 39, 41, 43, 45, 49, and 66 above. 

To determine the percent identity of two amino acid 
sequences or of two nucleic acids, the sequences are aligned 
for optimal comparison purposes (e.g., gaps can be introduced 

in the sequence of a first amino acid or nucleic acid 

15 sequence for optimal alignment with a second amino or nucleic 

acid sequence) * The amino acid residues or nucleotides at 

corresponding amino acid positions or nucleotide positions 

are then compared. When a position in the first sequence is 

occupied by the same amino acid residue or nucleotide as the 

corresponding position in the second sequence, then the 

20 molecules are identical at that position. The percent 

identity between the two sequences is a function of the 

number of identical positions shared by the sequences (i.e., 

% identity =# of identical overlapping posit ions /total # of 

positions x 100%). In one embodiment, the two sequences are 

the same length. 

25 The determination of percent identity between two 

sequences can also be accomplished using a mathematical 

algorithm. A preferred, non-limiting example of a 

mathematical algorithm utilized for the comparison of two 

sequences is the algorithm of Karlin and Altschul (1990) 

~ Proc. Natl. Acad. Sci. USA 87:2264-2268, modified as in 
30 

Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 
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$0:5873-5877. Such an algorithm is incorporated into the 
NBLAST and XBLAST programs of Altschul, et al. (1990) J*. Mol. 
Biol* 225:403-410. BLAST nucleotide searches can be 
performed with the NBLAST program, score - 100, wordlength = 
5 12 to obtain nucleotide sequences homologous to a nucleic 
acid molecules of the invention. BLAST protein searches can 
be performed with the XBLAST program, score « 50, wordlength 
= 3 to obtain amino acid sequences homologous to a protein 
molecules of the invention. To obtain gapped alignments for 
comparison purposes. Gapped BLAST can be utilized as 
10 described in Altschul et al. (1997) Nucleic Acids 

jRes. 25: 3389-3402. Alternatively, PSI-Blast can be used to 
perform an iterated search which detects distant 
relationships between molecules (Jd.). When utilizing BLAST, 

Gapped BLAST, and PSI-Blast programs, the default parameters 
of the respective programs (e.g. , XBLAST and NBLAST) can be 

15 

used (see http://www.ncbi.nlm.nih.gov). Another preferred, 
non-limiting example of a mathematical algorithm utilized for 
the comparison of sequences is the algorithm of Myers and 
Miller, (1988) CABIOS 4:11-17* Such an algorithm is 

incorporated into the ALIGN program (version 2.0) which is 
20 P a ^ t ot the GCG sequence alignment software package. Vfhen 
utilizing the ALIGN program for comparing amino acid 
sequences, a PAM120 weight residue table, a gap length 
penalty of 12, and a gap penalty of 4 can be used. 

The percent identity between two sequences can be 
determined using techniques similar to those described above, 
25 with or without allowing gaps, in calculating percent 
identity, typically only exact matches are counted. 

The HKNG1 nucleotide sequences of the invention further 

include: (a) any nucleotide sequence that hybridizes to a 
HKNG1 nucleic acid molecule of the invention under stringent 

conditions, e.g., hybridization to filter-bound DNA in 6x 

30 

sodium chloride/ sodium citrate (SSC) at about 45 °C followed 
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by one or more washes in 0.2xSSC/0.1% SDS at about 50-65°c, 
or (b) under highly stringent conditions, e.g., hybridization 

to filter-bound nucleic acid in 6xSSC at about 45°C followed 
by one or more washes in O.lxSSC/0.2% SDS at about 68 °C, or 
under other hybridization conditions which are apparent to 

5 

those of skill in the art (see, for example, Ausubel F*M* et 
al., eds., 1989, Current Protocols in Molecular Biology, Vol. 
I, Green Publishing Associates, Inc., and John Wiley & sons, 
Inc., New York, at pp. 6.3.1-6.3.6 and 2.10.3). Preferably 
the HKNG1 nucleic acid molecule that hybridizes to the 

nucleotide sequence of (a) and (b) , above, is one that 
comprises the complement of a nucleic acid molecule that 
encodes a HKNG1 gene product. In a preferred embodiment, 

nucleic acid molecules comprising the nucleotide sequences of 
(a) and (b) , above, encode gene products, e.g., gene products 

functionally equivalent to an HKNG1 gene product. 

* 5 Functionally equivalent HKNG1 gene products include 

naturally occurring HKNG1 gene products present in the same 
or different species. In one embodiment, HKNG1 gene 
sequences in non-human species map to chromosome regions 
syntenic to the human 18p chromosome location within which 

20 human HKNG1 lies. Functionally equivalent HKNG1 gene 

products also include gene products that retain at least one 
of the biological activities of the HKNG1 gene products, 
and/or which are recognized by and bind to antibodies 
(polyclonal or monoclonal) directed against the HKNG2 gene 

products . 

Among the nucleic acid molecules of the invention are 
deoxyoligonucleotides ("oligos") which hybridize under highly 
stringent or stringent conditions to the HKNG1 nucleic acid 
molecules described above. In general, for probes between 14 
and 70 nucleotides in length the melting temperature (TM) is 
3Q calculated using the formula: 

Tm(°C)==81.5+16.6( log [monovalent cations (molar) ]) +0. 41 (% 
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G+C)-(500/N) where N is the length of the probe. If the 
hybridization is carried out in a solution containing 
formamide, the melting temperature is calculated using the 
equation Tm(°C) =81. 5+16. 6 (log [monovalent cations 
(molar) ]) +0.41 (% G+C)-(0.61% formamide) -(500/N) where N is 
5 the length of the probe. In general, hybridization is 
carried out at about 20.-25 degrees below Tm (for DNA-DNA 
hybrids) or 10-15 degrees below Tm (for RNA-DNA hybrids). 

Exemplary highly stringent conditions may refer, e.g. , 
to washing in 6xSSC/0.05% sodium pyrophosphate at 37 °C (for 
about 14-base oligos), 48°C (for about 17-base oligos) , 55°C 
10 (for about 20-base oligos), and 60°c (for about 23-base 
oligos) . 

These nucleic acid molecules may encode or act as 
antisense molecules, useful, for example, in HKNG1 gene 
regulation, and/or as antisense primers in amplification 
reactions of HKNG1 gene nucleic acid sequences. Further, 

15 

such sequences may be used as part of ribozyme and/or trxple 
helix sequences, also useful for HKNG1 gene regulation. 
Still further, such molecules may be used as components of 
diagnostic methods whereby, for example, the presence of a 
particular HKNG1 allele involved in a HKNG1 —related disorder, 
20 e.g. . a neuropsychiatry disorder, such as BAD, may be 
detected. 

Fragments of the HKNG1 nucleic acid molecules can be at 
least 10 nucleotides in length. In alternative embodiments, 
the fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 
100, 200, 300, 400, 500, 1000, 1500, 2000, 2500, 3000, 3500, 
25 4000, 4500, 5000, or more contiguous nucleotides in length. 
Alternatively, the fragments can comprise sequences that 
encode at least 10, 20, 30, 40, 50, 100, 150, 200, 250, 300, 
350, 400, 450 or more contiguous amino acid residues of the 
HKNG1 gene products. Fragments of the HKNG1 nucleic acid 

' molecules can also refer to HKNG1 exons or introns, and, 
30 

further, can refer to portions of HKNG1 coding regions that 
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encode domains ( e.g. , cluster in domains) of HKNG1 gene 
products . 

The HKNG1 nucleotide sequences of the invention can be 

readily obtained, for example, by standard sequencing and the 
sequence provided herein ♦ 

As will be appreciated by those skilled in the art, DATA 
sequence polymorphisms 'of a HKNGl gene will exist within a 
population of individual organisms (e.g., within a human 

population) . Such polymorphisms may exist, for example, 
among individuals within a population due to natural allelic 

10 variation. Such polymorphisms include ones that lead to 

changes in amino acid sequence. An allele is one of a group 
of genes which occur alternatively at a given genetic locus. 

As used herein, the phrase "allelic variant" refers to a 
nucleotide sequence which occurs at a given locus or to a 
gene product encoded by that nucleotide sequence. Such 

15 natural allelic variations can typically result in 1-5% 
variance in the nucleotide sequence of a given gene. 
Alternative alleles can be identified by sequencing the gene 
of interest in a number of different individuals. This can 
be readily carried out by using hybridization probes to 
identify the same genetic locus in a variety of individuals. 

20 As used herein, the terms "gene" and "recombinant gene" refer 
to nucleic acid molecules comprising an open reading frame 
encoding a polypeptide of the invention. The term can 
further include nucleic acid molecules comprising upstream 
and/or exon/intron sequences and sructure* 

With respect to HKNGl allelic variants, any and all such 

25 nucleotide variations and resulting amino acid polymorphisms 
or variations that are the result of natural allelic 
variation of the HKNGl gene are intended to be within the 

scope of the present invention. Such allelic variants 

include, but are not limited to, ones that do not alter the 

functional activity of the HKNGl gene product. Variants 

30 

include, but are not limited to, variants comprising the 
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polymorphisms summarized in FIGS. 5A-B and a variant which 
encodes a full length HKNG1 polypeptide comprising a 

substitution of a lysine amino acid at amino acid residue 202 
of the HKNG1 polypeptide shown in FIG. 1A-1B and SEQ ID NO: 2 

or the HKNG1 amino acid residue 184 of the polypeptide of SEQ 

ID NO: 4. 

With respect to the cloning of additional allelic 
variants of the human HKNG1 gene and homologues and orthologs 

from other species (e.g., guinea pig, cow, mouse), the 

isolated HKNG2 gene sequences disclosed herein may be labeled 

10 

and used to screen a cDNA lxbrary constructed from mRNA 
obtained from appropriate cells or tissues (e.g., brain or 

retinal tissues) derived from the organism (e.g., guinea pig, 

cow, and mouse) of interest. The hybridization conditions 
used should generally be of a lower stringency when the cDNA 

iS library is derived from an organism different from the type 
of organism from which the labeled sequence was derived, and 
can routinely be determined based on, e.g. , relative 
relatedness of the target and refernce organisms. 

Alternatively, the labeled fragment may be used t o 
screen a genomic library derived from the organism of 

2o interest, again, using appropriately stringent conditions. 
Appropriate stringency conditions are well known to those of 
skill in the art as discussed above , and will vary 
predictably depending on the specific organisms from which 
the library and the labeled sequences are derived. For 
guidance regarding such conditions see, for example, 

25 Sambrook, et al . , 1989, Molecular cloning, A Laboratory 

Manual, Second Edition, cold Spring Harbor Press, N.Y.; and 
Ausubel, et al., 1989-1999, Current Protocols in Molecular 

Biology, Green Publishing Associates and Wiley Interscience, 
N.Y., both of which are incorporated herein by reference in 
- their entirety. 
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Further, a HKNG1 gene allelic variant may be isolated 
from, for example, human nucleic acid, by performing PCR 
using two degenerate oligonucleotide primer pools designed on 
the basis of amino acid sequences within the HKNG1 gene 
product disclosed herein* The template for the reaction may 
be cDNA obtained by reverse transcription of mRNA prepared 
from, for example, human or non-human cell lines or tissue 
known or suspected to express a wild type or mutant HKNGl 
gene allele (such as, for example, brain cells, including 
brain cells from individuals having BAD). In one embodiment, 

10 the allelic variant is isolated from an individual who has a 
HKNGl -mediated disorder. Such variants are described in the 
examples below. 

The PCR product may be subcloned and sequenced to ensure 
that the amplified sequences represent the sequences of a 
HKNGl gene nucleic acid sequence. The PCR fragment may then 

15 be used to isolate a full length cDNA clone by a variety of 
methods. For example, the amplified fragment may be labeled 
and used to screen a bacteriophage cDNA library. 
Alternatively, the labeled fragment may be used to isolate 
genomic clones via the screening of a genomic library. 

PCR technology may also be utilized to isolate full 

20 length cDNA sequences. For example, RNA may be isolated, 
following standard procedures, from an appropriate cellular 
or tissue source (i.e., one known, or suspected, to express 
the HKNGl gene , such as, for example, brain tissue samples 
obtained through biopsy or post-mortem) . A reverse 

25 transcription reaction may be performed on the RNA using an 
oligonucleotide primer specific for the most 5 1 end of the 
amplified fragment for the priming of first strand synthesis. 
The resulting RNA/DNA hybrid may then be "tailed" with 
guanines using a standard terminal transferase reaction, the 
hybrid may be digested with RNAase H, and second strand 

~ 0 synthesis may then be primed with a poly-c primer. Thus, 
cDNA sequences upstream of the amplified fragment may easily 
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be isolated. For a review of cloning strategies that may be 
used, see e.g., Sambrook et al . , 1989, supra, or Ausubel et 
al., supra, 

A cDNA of an allelic, e.g. f mutant, variant of the HKNGl 
gene may be isolated, for example, by using PCR, a technique 
that is well known to those of skill in the art. In this 
case, the first cDNA strand may be synthesized by hybridizing 
an oligo-dT oligonucleotide to mRNA isolated from tissue 
known or suspected to be expressed in an individual 
putatively carrying a mutant HKNGl allele, and by extending 

10 the new strand with reverse transcriptase. The second strand 
of the cDNA is then synthesized using an oligonucleotide that 
hybridizes specif ically to the 5* end of the normal gene. 
Using these two primers, the product is then amplified via 
PCR, cloned into a suitable vector, and subjected to DNA 
sequence analysis through methods well known to those of 

x5 skill in the art. By comparing the DNA sequence of the 

mutant HKNGl allele to that of the normal HKNGl allele, the 
mutation(s) responsible for the loss or alteration of 
function of the mutant HKNGl gene product can be ascertained. 

Alternatively, a genomic library can be constructed 
using DNA obtained from an individual suspected of or known 

20 to carry a mutant HKNGl allele, or a cDNA library can be 

constructed using RNA from a tissue known, or suspected, to 
express a mutant HKNGl allele. An unimpaired HKNGl gene or 

any suitable fragment thereof may then be labeled and used as 
a probe to identify the corresponding mutant HKNGl allele in 

25 such libraries. Clones containing the mutant HKNGl gene 

sequences may then be purified and subjected to sequence 
analysis according to methods well known to those of skill in 
the art. 

Additionally, an expression library can be constructed 
~ utilizing cDNA synthesized from, for example, RNA isolated 
30 from a tissue known, or suspected, to express a mutant HKNGl 
allele in an individual suspected of or known to carry such a 
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mutant allele. In this manner, gene products made by the 
putatively mutant tissue may be expressed and screened using 
standard antibody screening techniques in conjunction with 
antibodies raised against the normal HKNG1 gene product, as 

described, below, in Section 5.3. (For screening techniques, 

5 

see, for example, Harlow and Lane, eds., 1988, "Antibodies: A 
Laboratory Manual", Cold Spring Harbor Press, Cold Spring 
Harbor . ) 

In cases where a HKNG1 mutation results in an expressed 

gene product with altered function (e.g., as a result of a 

10 missense or a f rameshif t mutation) , a polyclonal set of anti- 
HKNG1 gene product antibodies are likely to cross-react with 

the mutant HKNG1 gene product . Library clones detected via 

their reaction with such labeled antibodies can be purified 
and subjected to sequence analysis according to methods well 
known to those of skill in the art. 
15 HKNG1 mutations or polymorphisms can further be detected 

using PCR amplification techniques. Primers can routinely be 
designed to amplify overlapping regions of the whole HKNG1 

sequence including the promoter regulating region. In one 
embodiment, primers are designed to cover the exon-intron 
boundaries such that, coding regions can be scanned for 
mutations. Exemplary primers for analyzing HKNG1 exons are 

provided in Table 1, of Section 5.6, below. 

The invention also includes nucleic acid molecules, 
preferably DNA molecules, that are the complements of the 
nucleotide sequences of the preceding paragraphs. 

25 In certain embodiments, the nucleic acid molecules of 

the invention are present as part of nucleic acid molecules 
comprising nucleic acid sequences that do not contain 
heterologous ( e.g. , cloning vector or expression vector) 
sequences. In other embodiments, the nucleic acid molecules 
of the invention further comprise vector sequences, e.g. , 

30 cloning vectors or expression vectors. 
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5.2. PROTEIN PRODUCTS OF THE HKNQ1 GENE 
HKNG1 gene products or peptide fragments thereof, can be 
prepared for a variety of uses. For example , such gene 
products, or peptide fragments thereof, can be used for the 
generation of antibodies, in diagnostic assays, or for the 
identification of other cellular or extracellular gene 
products involved in the regulation of HKNG1 -mediated 
disorders, e.g. , neuropsychiatric disorders, such as BAD. 

The HKNG1 gene products of the invention include, but 
are not limited to, human HKNG1 gene products, e.g., 

x0 polypeptides comprising the amino acid sequences depicted in 
FIGS. 1A-1B, 2A-2B, 17, and 18 (i.e., SEQ ID N0S:2, 4, 51, 
and 66) . The HKNG1 gene products of the invention also 
include non-human, e.g., mammalian (such as bovine or guinea 
pig) , HKNG1 gene products. These include, but are not 

15 limited to, polypeptides comprising the amino acid sequences 
depicted in FIGS. 7-13 (i.e., SEQ ID NOS:39, 41, 43, 45, and 
49). 

HKNG1 gene product, sometimes referred to herein as an 
M OTC1 protein" or "HKNGl polypeptide, " includes those gene 
20 products encoded by the HKNG1 gene sequences depicted in 
FIGS. 1A-1B, 2A-2B, 7-13, 17, and 18 , as well as gene 
products encoded by other human allelic variants and non- 
human variants of HKNG1 that can be identified by the methods 
herein described. Among such HKNG1 gene product variants are 
gene products comprising HKNG1 amino acid residues encoded by 

25 

the polymorphisms depicted in FIGS. 5 A and 5B. Such gene 
product variants also include a variant of the HKNG1 gene 

product depicted in FIG. 1 (SEQ ID NO: 2) wherein the amino 
acid residue Iiys202 is mutated to a glutamic acid residue. 
Such EKNG1 gene product variants also include a variant of 

30 the SKNGl gene product depicted in FIG. 2 (SEQ ID NO: 4) 
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wherein the amino acid residue Lysl84 is mutated to a 
glutamic acid residue. 

In addition, HKNG1 gene products may include proteins 
that represent functionally equivalent gene products. 
Functionally equivalent gene products may include, for 
example, gene products encoded by one of the HKNGl nucleic 
acid molecules described in Section 5.1, above . In preferred 
embodiments, such functionally equivalent HKNGl gene products 
are naturally occuring gene products. Functionally 
equivalent HKNGl gene products also include gene products 
10 that retain at least one of the biological activities of the 
HKNGl gene products described above, and/ or which are 
recognized by and bind to antibodies (polyclonal or 
monoclonal) directed against HKNGl gene products. 

Equivalent HKNGl gene product may contain deletions, 
15 including internal deletions, additions, including additions 
yielding fusion proteins, or substitutions of amino acid 
residues within and/ or adjacent to the amino acid sequence 
encoded by the HKNGl gene sequences described, above, in 
Section 5.1. Generally, deletions will be deletions of 
single amino acid residues, or deletions of no more than 
20 about 2, 3, 4, 5, 10 or 20 amino acid residues, either 
contiguous or non-contiguous. Generally, additions or 
substitutions, other than additions that yield fusion 
proteins, will be additions or substitutions of single amino 
acid residues, or additions or substitutions of no more than 
about 2, 3, 4, 5, 10 or 20 amino acid residues, either 
25 contiguous or non-contiguous. Preferably, these 

modifications result in a "silent" change, in that the change 
produces a HKNGl gene product with the same activity as the 

HKNGl gene product depicted in FIG. 1A-1B, 2A-2B, 7-13, or 
17. 

Amino acid substitutions may be made on the basis of 
30 similarity in polarity, charge, solubility, hydrophobicity, 
hydrophilicity, and/or the amphipathic nature of the residues 
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involved. For example, nonpolar (hydrophobic) amino acids 
include alanine, leucine, isoleucine, valine, proline, 
phenylalanine, tryptophan, and methionine; polar neutral 
amino acids include glycine, serine, threonine, cysteine, 
tyrosine, asparagine, and glutamine; positively charged 
5 (basic) amino acids include arginine, lysine, and histidine; 
and negatively charged (acidic) amino acids include aspartic 
acid and glutamic acid. 

Alternatively, where alteration of function is desired, 
addition (s), deletion (s) or non-conservative alterations can 
produce altered, including reduced- activity, HKNGl gene 

10 products. Such alterations can, for example, alter one or 
more of the biological functions of the HKNG1 gene product. 

Further, such alterations can be selected so as to generate 
HKNG1 gene products that are better suited for expression, 

scale up, etc. in the host cells chosen. For example, 
15 cysteine residues can be deleted or substituted with another 
amino acid residue in order to eliminate disulfide bridges. 
As another example, altered HKNGl gene products can be 

engineered that correspond to variants of the HKNGl gene 

product associated with HKNGl -mediated neuropsychiatry 

disorders such as BAD. Such altered HKNGl gene products 

20 

include, but are not limited to, HKNGl proteins or peptides 
comprising substitution of a lysine residue for the wild-type 
glutamic acid residue at HKNGl amino acid position 202 in~ 

FIG. IA-1B (SEQ ID NO: 2) or amino acid position 184 (SEQ ID 
NO: 4) in FIG. 2A-2B. 

25 HKNGl protein fragments and/ or HKNGl peptides comprise 

at least as many contiguous amino acid residues as necessary 
to represent an epitope fragment (that is to be recognized by 
an antibody directed to the HKNGl protein) . For example , 
such protein fragments or peptides comprise at least about 8 
contiguous HKNGl amino acid residues from a full length HKNGl 

30 protein. In alternate embodiments, the HKNGl protein 

fragments and peptides of the invention can comprise about 
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10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 
350, 400, 450 or more contiguous amino acid residues of a 
HKNG1 protein. 

Peptides and/or proteins corresponding to one or more 
domains of the HKNG1 protein as well as fusion proteins in 

5 

which a MKNG1 protein, or a portion of a HKNGl protein such 

as a truncated HKNGl protein or peptide or a KKNG1 protein 

domain, is fused to an unrelated protein are also within the 
scope of this invention. Such proteins and peptides can be 
designed on the basis of the HKNGl nucleotide sequence 

10 disclosed in Section 5.1, above, and/or on the basis of the 
HKNGl amino acid sequence disclosed in the Section. Fusion 

proteins include, but are not limited to, igFc fusions which 
stabilize the HKNGl protein or peptide and prolong half life 
in vivo; or fusions to any amino acid sequence that allows 

the fusion protein to be anchored to the cell membrane; or 
fusions to an enzyme, fluorescent protein, luminescent 
protein, or a flag epitope protein or peptide which provides 
a marker function. 

The HKNGl protein, the HKNGl protein sequences described 
above can include a domain which comprises a signal sequence 
20 that targets the HKNGl gene product for secretion. As used 

herein, a signal sequence includes a peptide of at least 
about 15 or 20 amino acid residues in length which occurs at 
the N-terminus of secretory and membrane -bound proteins and 
which contains at least about 70% hydrophobic amino acid 
residues such as alanine, leucine, isoleucine, phenylalanine, 

25 proline, tyrosine, tryptophan, or valine. In a preferred 
embodiment, a signal sequence contains at least about 10 to 
40 amino acid residues, preferably about 19-34 amino acid 
residues, and has at least about 60-80%, more preferably 
65-75%, and more preferably at least about 70% hydrophobic 
residues. A signal sequence serves to direct a protein 

30 containing such a sequence to a lipid bilayer. 
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In one embodiment, a HNKNGl protein contains a signal 
sequence at about amino acids 1 to 49 of SEQ ID NO: 2. In 
another embodiment, a HKNG1 protein contains a signal 
sequence at about amino acids 30-49 of SEQ ID NO: 2. In yet 
another embodiment, a HKNGl protein contains a signal 
sequence at about amino acid residues 1 to 31 of SEQ ID NO: 4. 
Xn yet another embodiment, a HKNGl protein contains a signal 
sequence at about amino acids 12-31 of SEQ ID NO: 4. The 
signal sequence is cleaved during processing of the mature 
protein, 

A signal sequence of a polypeptide of the invention can 
be used to facilitate secretion and isolation of the secreted 
protein or other proteins of interest. Signal sequences are 
typically characterized by a core of hydrophobic amino acids 
which are generally cleaved from the mature protein during 
secretion in one or more cleavage events. Such signal 
peptides contain processing sites that allow cleavage of the 

IS 

signal sequence from the mature proteins as they pass through 
the secretory pathway. Thus, the invention pertains to the 
described HKNGl polypeptides having a signal sequence (that 
is, "immature" polypeptides), as well as to the HKNGl signal 
sequences themselves and to the HNKG1 polypeptides in the 
20 absence of a signal sequence (i.e., the ••mature" HKNGl 
cleavage products) . It is to be under stood that HKNGl 
polypeptides of the invention can further comprise 
polypeptides comprising any signal sequence having 
characteristics as described above and a mature HKNGl 

25 polypeptide sequence ♦ 

In one embodiment, a nucleic acid sequence encoding a 
signal sequence of the invention can be operably linked in an 
expression vector to a protein of interest, such as a protein 
which is ordinarily not secreted or is otherwise difficult to 
isolate. The signal sequence directs secretion of the 

33 protein, such as from a eukaryotic host into which the 

expression vector is transf ormed, and the signal sequence is 
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30 



subsequently or concurrently cleaved. The protein can then 
be readily purified from the extracellular medium by art 
recognized methods. Alternatively, the signal sequence can 
be linked to the protein of interest using a sequence which 
facilitates purification, such as with a GST domain. 

The HKNG1 protein sequences described above can also 
include one or more domains which comprise a clusterin 
domain, i.e., domains which are identical to or substantially 

homologous to ( i.e. . 65%, 75%, 80%, 85%, 90%, 95% or more 
identical to) the domain corresponding to amino acid residues 
134 to 160 or amino acid residues 334 to 3 62 of SEQ ID NO: 2, 
or to the domain corresponding to amino acid residues 105-131 
or amino acid residues 305-333 of SEQ ID No: 39, or to the 
domain corresponding to amino acid residues 105-131 or amino 
acid residues 304-332 of SEQ ID NO: 49. Preferably, such 
domains comprise cysteine amino acid residues at positions 
corresponding to conserved cysteine residues of the clusterin 
domains of SEQ ID NOS; 2, 39 or 49. 

In particular, HKNG1 protein sequences described above 
can also include one or more domains which comprise a 
conserved cysteine domain. Such a domain corresponds, for 
example, to the domain of cysteines corresponding to Cysl3 4, 
Cysl45, Cysl48, Cysl58 and Cysl60; or to Cys 334, Cys344, 
Cys351, Cys354, and Cys362 of SEQ ID NO: 2. In an alternative 
embodiment, a conserved cystein domain corresponds to one or 
more of the domains of SEQ ID NO: 39 which comprises CyslOS, - 
Cysll6, Cysll9 r Cysl24, and Cysl31; or Cys305, Cys315, 
Cys322, Cys325, and Cys333. In yet another alternative 
embodiment, a conserved cysteine domain corresponds to one or 
more of the domains of SEQ ID NO: 49 which comprises CyslOS, 
CysllS, Cysll9, Cysl24, and Cysl31; or Cys314, Cys321, 
Cys324, and Cy s3 32. 

Finally, the HKNG1 proteins of the invention also 
include HKNG1 protein sequences wherein domains encoded by 
one or more exons of the cDNA sequence, or fragments thereof, 
have been deleted. In one particularly preferred embodiment, 
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the HKNG1 proteins of the invention are proteins in which the 
domain (s) corresponding the those domains encoded by exon 7 
of SEQ ID NO: 7, or fragments thereof , have been deleted. 

The HKNG1 polypeptides of the invention can further 
comprise posttranslational modifications, including, but not 

5 

limited to glycosylations, acetylations , and myrisalations. 

The HKNG1 gene products , peptide fragments thereof and 
fusion proteins thereof, may be produced by recombinant DNA 
technology using techniques well known in the art. Thus, 
methods for preparing the HKNG1 gene products, polypeptides, 
XO Peptides, fusion peptide and fusion polypeptides of the 
invention by expressing nucleic acid containing HKNG1 gene 

sequences are described herein. Methods that are well known 
to those skilled in the art can be used to construct 
expression vectors containing HKNG1 gene product coding 

sequences and appropriate transcriptional and translational 
15 control signals. These methods include, for example, in 

vitro recombinant DNA techniques, synthetic techniques, and 

in vivo genetic recombination. See, for example, the 

techniques described in Sambrook, et aj w 1989, supra, and 

Ausubel, et al., 1989, supra* Alternatively, RNA capable of 

20 encoding HKNG1 gene product sequences may be chemically 

synthesized using, for example, synthesizers. See, for 
example, the techniques described in ••Oligonucleotide 
Synthesis", 1984, Gait, ed. , IRL Press, Oxford. 

A variety of host-expression vector systems may be 
utilized to express the HKNG1 gene product coding sequences 

25 

of the invention. Such host-expression systems represent 
vehicles by which the coding sequences of interest may be 
produced and subsequently purified, but also represent cells 
that may, when transformed or transfected with the 
appropriate nucleotide coding sequences, exhibit the HKNG1 

3 0 gene product of the invention in situ. These include but are 

not limited to microorganisms such as bacteria (e.g., JE. 
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coll, B, subtllis) transformed with recombinant bacteriophage 
DNA, plasmid DNA or cosmid DNA expression vectors containing 
KKNG1 gene product coding sequences; yeast (e.g., 

Saccharomyces , Pichia) transformed with recombinant yeast 

5 expression vectors containing the HKNG1 gene product coding 

sequences; insect cell systems infected with recombinant 
virus expression vectors (e*g. , baculovirus) containing the 

HKNG1 gene product coding sequences; plant cell systems 

infected with recombinant virus expression vectors (e.g*, 

^ cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or 
transformed with recombinant plasmid expression vectors 
(e.gr., Ti plasmid) containing HKNG1 gene product coding 

sequences; or mammalian cell systems (e.g., COS, CHO, BHK, 

293, 3T3) harboring recombinant expression constructs 
containing promoters derived from the genome of mammalian 
15 cells (e»gr., metallothionein promoter) or from mammalian 

viruses (e.g., the adenovirus late promoter; the vaccinia 

virus 7 . 5K promoter) . 

In bacterial systems, a number of expression vectors may 
be advantageously selected depending upon the use intended 
for the HKNG1 gene product being expressed. For example, 

20 

when a large quantity of such a protein is to be produced, 
for the generation of pharmaceutical compositions of BKNGl 
gene product or for raising antibodies to HKNG2 gene product, 

for example, vectors that direct the expression of high 
levels of fusion protein products that are readily purified 
25 may be desirable. Such vectors include, but are not limited, 
to the Em coli expression vector pUR2 78 (Ruther et al-, 1983, 
EMBO J. 2:1791), in which the HKNG1 gene product coding 
sequence may be ligated individually into the vector in frame 
with the lacZ coding region so that a fusion protein is 

~ Q produced; pIN vectors (Inouye and Inouye, 1985, Nucleic Acids 
Res* 13:3101-3109; Van Heeke and Schuster, 1989, J. Biol. 
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Chem. 264:5503-5509); and the like. pGEX vectors may also be 
used to express foreign polypeptides as fusion proteins with 
glutathione S-transf erase (GST) . In general, such fusion 
proteins are soluble and can easily be purified from lysed 
cells by adsorption to glutathione-agarose beads followed by 
5 elution in the presence of free glutathione- The pGEX 
vectors are designed to. include thrombin or factor Xa 
protease cleavage sites so that the cloned target gene 
product can be released from the GST moiety. 

In an insect system, Autographa calif ornica, nuclear 
polyhidrosis virus (AcNPV) is used as a vector to express 

10 

foreign genes* The virus grows in Spodoptera frugiporda 
cells. The HKNG1 gene product coding sequence may be cloned 
individually into non-essential regions (for example the 
polyhedrin gene) of the virus and placed under control of an 
AcNPV promoter (for example the polyhedrin promoter). 
i5 Successful insertion of HKNGX gene product coding sequence 
will result in inactivation of the polyhedrin gene and 
production of non-occluded recombinant virus (i.e., virus 

lacking the proteinaceous coat coded for by the polyhedrin 
gene) . These recombinant viruses are then used to infect 
Spodopt&ra frugiperda cells in which the inserted gene is 

20 

expressed* (e.gr. , see Smith, et al., 1983, J*. Virol. 46:584; 
Smith, U.S. Patent No. 4,215,051). 

In mammalian host cells, a number of viral-based 
expression systems may be utilized. In cases where an 
adenovirus is used as an expression vector, the HKNG1 gene 

25 product coding sequence of interest may be ligated to an 
adenovirus transcription/ translation control complex, e.g. , 
the late promoter and tripartite leader sequence. This 
chimeric gene may then be inserted in the adenovirus genome 
by in vitro or in vivo recombination. Insertion in a non- 

- essential region of the viral genome (e.g., region El or E3) 
30 

will result in a recombinant virus that is viable and capable 
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of expressing HKNGl gene product in infected hosts. (e.gr., 

See Logan and Shenk, 1984, Proc. Natl. Acad. Sci. USA 
81:3655-3659). Specific initiation signals may also be 
required for efficient translation of inserted HKNGl gene 

product coding sequences. These signals include the ATG 

s 

initiation codon and adjacent sequences. In cases where an 
entire HKNGl gene, including its own initiation codon and 

adjacent sequences, is inserted into the appropriate 
expression vector, no additional translational control 
signals may be needed. However, in cases where only a 
10 portion of the HKNGl gene coding sequence is inserted, 

exogenous translational control signals, including, perhaps, 
the ATG initiation codon, must be provided. Furthermore, the 
initiation codon must be in phase with the reading frame of 
the desired coding sequence to ensure translation of the 
entire insert. These exogenous translational control signals 

15 and initiation codons can be of a variety of origins, both 
natural and synthetic. The efficiency of expression may be 
enhanced, by the inclusion of appropriate transcription 
enhancer elements, transcription terminators, etc. (see 
Bittner, et al., 1987, Methods in Enzymol. 153:516-544). 
In addition, a host cell strain may be chosen that 

20 modulates the expression of the inserted sequences, or 
modifies and processes the gene product in the specific 
fashion desired. Such modifications (e.g., glycosylation) 

and processing (e.g., cleavage) of protein products may be 
important for the function of the protein* Different host 
cells have characteristic and specific mechanisms for the 

25 

post-translational processing and modification of proteins 
and gene products. Appropriate cell lines or host systems 
can be chosen to ensure the correct modification and 
processing of the foreign protein expressed. To this end, 
eukaryotic host cells that possess the cellular machinery for 
~ proper processing of the primary transcript, glycosylation, 
and phosphorylation of the gene product may be used* Such 



- 38 - 



WO 99/47535 



PCT/US99/05606 



mammalian host: cells include but are not limited to CHO, 
VERO, BHK, HeLa, COS, MDCK, 293, 3T3, andWI38. 

For long-term, high-yield production of recombinant 
proteins, stable expression is preferred. For example, cell 
lines that stably express the JiKNGl gene product may be 
5 engineered. Rather than using expression vectors that 
contain viral origins of replication, host cells can be 
transformed with DNA controlled by appropriate expression 
contro 1 elements (e.g., promoter , enhancer , sequences , 
transcription terminators, polyadenylation sites, etc.), and 
a selectable marker. Following the introduction of the 

10 

foreign DNA, engineered cells may be allowed to grow for 1-2 
days in an enriched media , and then are switched to a 
selective media. The selectable marker in the recombinant 
plasmid confers resistance to the selection and allows cells 
to stably integrate the plasmid into their chromosomes and 
grow to form foci that in turn can be cloned and expanded 
into cell lines. This method may advantageously be used to 
engineer cell lines that express the HKNGX gene product. 
Such engineered cell lines may be particularly useful in 
screening and evaluation of compounds that affect the 
endogenous activity of the HKNG1 gene product. 
20 A number of selection systems may be used, including but 

not limited to the herpes simplex virus thymidine kinase 
(Wigler, et al., 1977, Cell XI: 223), hypoxanthine-guanine 
phosphoribosyltransf erase (Szybalska and Szybalski, 1962, 
Proc. Natl. Acad. Sci. USA 48:2026), and adenine 
phosphoribosyltransf erase (Lowy, et al., 1980, Cell 22:817) 

25 genes can be employed in tk~, hgprt" or aprt" cells, 

respectively- Also, antimetabolite resistance can be used as 
the basis of selection for the following genes: dhfr, which 
confers resistance to methotrexate (Wigler, et al., 1980, 
Natl. Acad. Sci. USA 77:3567; O f Hare, et al., 1981, Proc. 

30 Natl. Acad. Sci. USA 78:1527); gpt, which confers resistance 
to mycophenolic acid (Mulligan and Berg, 1981, Proc. Natl. 



- 39 - 



WO 99/47535 



PCT/US99/05606 



Acad, Sci. USA 78:2072); neo, which confers resistance to the 
aminoglycoside G-418 (Colberre^Garapin, et al., 1981 r J. Mol. 
Biol. 150:1); and hygro, which confers resistance to 
hygromycin (Santerre, et al. # 1984, Gene 30:147) • 

Alternatively, the expression characteristics of an 
endogenous HKNG1 gene within a cell line or microorganism may 

be modified by inserting a heterologous DNA regulatory 
element into the genome of a stable cell line or cloned 
microorganism such that the inserted regulatory element is 
operatively linked with the endogenous HKNGl gene. For 

10 example, an endogenous HKNG1 gene which is normally 

"transcriptionally silent", i.e., an HKNG1 gene which is 

normally not expressed, or is expressed only at very low 
levels in a cell line or microorganism, may be activated by 
inserting a regulatory element which is capable of promoting 
the expression of a normally expressed gene product in that 

15 

cell line or microorganism. Alternatively, a 
transcriptionally silent, endogenous HKNG1 gene may be 

activated by insertion of a promiscuous regulatory element 
that works across cell types. 

A heterologous regulatory element may be inserted into a 
stable cell line or cloned microorganism, such that it is 

20 

operatively linked with an endogenous HKNG2 gene, using 
techniques, such as targeted homologous recombination, which 
are well known to those of skill in the art, and described 
e.g., in Chappel, U.S. Patent No. 5,272,071; PCT publication 

No. WO 91/06667, published May 16, 1991. 
25 Alternatively, any fusion protein may be readily 

purified by utilizing an antibody specific for the fusion 
protein being expressed. For example, a system described by 
Janknecht, et al. allows for the ready purification of non- 
denatured fusion proteins expressed in human cell lines 

" (Janknecht, et al., 1991, Proc. Natl. Acad. Sci. USA 88:8972* 
30 

8976) . In this system, the gene of interest is subcloned 



- 40 - 



WO 99/47535 



PCT/US99/05606 



into a vaccinia recombination plasmid such that the gene»s 
open reading frame is translationally fused to an amino- 
terminal tag consisting of six histidine residues. Extracts 
from cells infected with recombinant vaccinia virus are 
loaded onto Ni 2+ -nitriloacetic acid-agarose columns and 
5 histidine-tagged proteins are selectively eluted with 
imidazole-containing buffers. 

The &KNG1 gene products can also be expressed in 

transgenic animals* Animals of any species, including , but 
not limited to, mice, rats, rabbits, guinea pigs, pigs, 
micro-pigs, goats, sheep, cows, and non-human primates, e.gr. , 

10 

baboons, monkeys, and chimpanzees may be used to generate 
HKNG1 transgenic animals* The term "transgenic," as used 

herein, refers to animals expressing HKNG1 gene sequences 

from a different species (e.g., mice expressing huma EKNG1 

gene sequences) , as well as animals that have been 
15 genetically engineered to over express endogenous (i.e., same 

species) EKNG1 sequences or animals that have been 

genetically engineered to no longer express endogenous HKNG1 

gene sequences (i.e., "knock-out" animals), and their 

progeny. 

2 0 Any technique known in the art may be used to introduce 

a HKNG1 gene transgene into animals to produce the founder 
lines of transgenic animals. Such techniques include, but 
are not limited to pronuclear microinjection (Hoppe and 
Wagner, 1989, U.S. Pat. No. 4,873,191) ; retrovirus mediated 
gene transfer into germ lines (Van der Putten, et al., 1985, 

25 Proc. Natl. Acad. Sci., USA 82:6148-6152); gene targeting in 
embryonic stem cells (Thompson, et al., 1989, Cell 56:313- 
321); electroporation of embryos (Lo, 1983, Mol. Cell. Biol. 
3:1803-1814); and sperm-mediated gene transfer (Lavitrano et 
al., 1989, Cell 57:717-723) (For a review of such techniques, 

30 see Gordon, 1989, Transgenic Animals, Intl. Rev. Cytol. 115, 
171-229) 
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Any technique known in the art may be used to produce 
transgenic animal clones containing a HKNG1 transgene, for 
example , nuclear transfer into enucleated oocytes of nuclei 
from cultured embryonic, fetal or adult cells induced to 
quiescence (Campbell, et al., 1996, Nature 380:64-66; Wilmut, 
et-al., Nature 385:810-813)* 

The present invention provides for transgenic animals 
that carry a HKNG1 transgene in all their cells, as veil as 
animals that carry the transgene in some, but not all their 
cells, i.e., mosaic animals. The transgene may be integrated 
10 as a single transgene or in concatamers, e.g., head-to-head 

tandems or head-to-tail tandems. The transgene may also be 
selectively introduced into and activated in a particular 
cell type by following, for example, the teaching of Lasko et 
al. (Lasko, et al., 1992, Proc, Natl. Acad. Sci. USA 89:6232- 
15 6236). The regulatory sequences required for such a cell- 
type specific activation will depend upon the particular cell 
type of interest, and will be apparent to those of skill in 
the art. When it is desired that the HKNG1 transgene be 

integrated into the chromosomal site of the endogenous EKNG1 

gene, gene targeting is preferred. Briefly, when such a 
20 technique is to be utilized, vectors containing some 

nucleotide sequences homologous to the endogenous HKNGl gene 

are designed for the purpose of integrating,- via homologous 
recombination with chromosomal sequences, into and disrupting 
the function of the nucleotide sequence of the endogenous 
HKNG1 gene. The transgene may also be selectively introduced 

25 

into a particular cell type, thus inactivating the endogenous 
HKNGl gene in only that cell type, by following, for example, 

the teaching of Gu, et al. (Gu, et al., 1994 , Science 265, 

103-106) . The regulatory sequences required for such a cell- 
~ type specific inactivation will depend upon the particular 
30 cell type of interest, and will be apparent to those of skill 

in the art . 
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Once transgenic animals have been generated, the 
expression of the recombinant HKNGl gene may be assayed 
utilizing standard techniques. Initial screening may be 
accomplished by Southern blot analysis or PCR techniques to 
analyze animal tissues to assay whether integration of the 
5 transgene has taken place. The level of mRNA expression of 
the transgene in the tissues of the transgenic animals may 
also be assessed using techniques that include but are not 
limited to Northern blot analysis of tissue samples obtained 
from the animal, in situ hybridization analysis, and RT-PCR 
(reverse transcriptase PCR) . Samples of HKNGl gene- 

10 

expressing tissue, may also be evaluated immunocytochemically 
using antibodies specific for the HKNGl transgene product. 

HKNGl proteins can be used, e.g. , to treat CNS-related 
disorders, e.g. , neuropsychiatric disorders. Such HKNGl gene 
products include but are not limited to soluble derivatives 

15 such as peptides or polypeptides corresponding to one or more 
domains of the HKNGl gene product, particularly HKNGl gene 
products , that are modified such that they are deleted for 
one or more hydrophobic domains. Alternatively, antibodies 
to the HKNGl protein or anti-idiotypic antibodies that mimic 

20 the HKNGl gene product (including Fab fragments) , antagonists 
or agonists can be used to treat neuropsychiatric disorders 
involving HKNGl. In yet another approach, nucleotide 
constructs encoding such HKNGl gene products can be used to 
genetically engineer host cells to express such HKNGl gene 
products In vivo : these genetically engineered cells can 
function as "bioreactors" in the body delivering a continuous 
supply of HKNGl gene product, HKNGl peptides/ soluble HKNGl 
polypeptides. 

5,3. ANTIBODIES TO HKNGl GEKB PRODUCTS 
30 Described herein are methods for the production of 

antibodies capable of specifically recognizing one or more 

- 43 - 



WO 99/47535 



PCT/US99/05606 



HKNGl gene product epitopes or epitopes of conserved variants 

or peptide fragments of the HKNG1 gene products. Further, 

antibodies that specifically recognize mutant forms of HKNGl, 

are encompassed by the invention* The terms "specifically 
5 bind" and "specifically recognize" refer to antibodies that 
bind to HKNG1 gene product epitopes at a higher affinity than 

they bind to non-HKNGl ( e.g. . random) epitopes. 

Such antibodies may include, but are not limited to, 
polyclonal antibodies, monoclonal antibodies (mAbs) , 
humanized or chimeric antibodies, single chain antibodies, 
xo Fab fragments, F(ab*) 2 fragments, fragments produced by a Fab 
expression library, anti-idiotypic (anti-Id) antibodies, and 
epitope-binding fragments of any of the above,, including the 
polyclonal and monoclonal antibodies described in Section 12 
below. Such antibodies may be used, for example, in the 
detection of a HKNGl gene product in an biological sample and 

15 

may, therefore, be utilized as part of a diagnostic or 
prognostic technique whereby patients may be tested for 
abnormal levels of HKNG1 gene products, and/ or for the 
presence of abnormal forms of such gene products. Such 
antibodies may also be utilized in conjunction with, for 
example, compound screening schemes, as described, below f in 
Section 5.8, for the evaluation of the effect of test 
compounds on HKNG1 gene product levels and/ or activity. 

Additionally, such antibodies can be used in conjunction with 
the gene therapy techniques described, below, in Section 
5,9.2 to, for example, evaluate the normal and/or engineered 
25 Hrarci-expressing cells prior to their introduction into the 
patient. 

Ant>i-HKNG1 gene product antibodies may additionally be 
used in methods for inhibiting abnormal HKNGl gene product 
activity. Thus, such antibodies may, therefore, be utilized 
as part of treatment methods for a HKNGl -mediated 

30 

neuropsychiatry disorder, such as BAD or schizophrenia. 
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For the production of antibodies against a HKNG1 gene 
product, various host animals may be immunized by injection 
with a HKNG1 gene product, or a portion thereof. Such host 
animals may include, but are not limited to rabbits, mice, 
and rats, to name but a few. Various adjuvants may be used 
to increase the immunological response, depending on the host 
species, including but not limited to Freund f s (complete and 
incomplete), mineral gels such as aluminum hydroxide, surface 
active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, dinitrophenol, and potentially useful human 
adjuvants such as BCG (bacille Calmette-Guerin) and 
Corynebacterium parvum. 

Polyclonal antibodies are heterogeneous populations of . 
antibody molecules derived from the sera of animals immunized 
with an antigen, such as a BKNG1 gene product, or an 
15 antigenic functional derivative thereof. For the production 
of polyclonal antibodies, host animals such as those 
described above, may be immunized by injection with HKNG1 
gene product supplemented with adjuvants as also described 
above . 

Monoclonal antibodies, which are homogeneous populations 
550 of antibodies to a particular antigen, may be obtained by any 
technique that provides for the production of antibody 
molecules by continuous cell lines in culture. These 
include, but are not limited to, the hybridoma technique of 
Kohler and Milstein, (1975, Nature 256:495-497; and U.S. 
Patent No. 4,376,110) , the human B-cell hybridoma technique 
25 (Kosbor et al., 1983, immunology Today 4:72; Cole et a!., 

1983, Proc. Natl. Acad. Sci. USA 80:2026-2030), and the EBV- 
hybridoma technique (Cole et al-, 1985, Monoclonal Antibodies 
And Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Such 
antibodies may be of any immunoglobulin class including IgG, 
IgM, IgE, IgA, IgD and any subclass thereof. The hybridoma 

30 

producing the mAb of this invention may be cultxvated an 
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vitro or in vivo* Production of high titers of mAbs in vivo 

makes this the presently preferred method of production* 

In addition, techniques developed for the production of 
"chimeric antibodies" (Morrison, et al., 1984, Proc. Natl* 

Acad. Sci., 81:6851-6855; Neuberger, et'al., 1984, Nature 

312:604-608; Takeda, et al., 1985, Nature, 314:452-454) by 

splicing the genes from a mouse antibody molecule of 
appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity 
can be used. A chimeric antibody is a molecule in which 
X0 different portions are derived from different animal species, 
such as those having a variable region derived from a murine 
mAb and a human immunoglobulin constant region. (See, e.g., 

Cabilly et al., U.S. Patent No. 4,816,567; and Boss et al., 
U.S. Patent No. 4,816397, which are incorporated herein by 
reference in their entirety.) 
15 In addition, techniques have been developed for the 

production of humanized antibodies. (See, e.g., Queen, U.S. 

Patent No. 5,585,089, which is incorporated herein by 
reference in its entirety.) An immunoglobulin light or heavy 
chain variable region consists of a "framework" region 
interrupted by three hypervariable regions, referred to as 

20 

complement arily determining regions (CDRs) . The extent of 
the framework region and CDRs have been precisely defined 
(see, "Sequences of Proteins of Immunological Interest", 
Kabat, E. et al., U.S. Department of Health and Human 
Services (1983) ) . Briefly, humanized antibodies are 
antibody molecules from non-human species having one or more 

25 

CDRs from the non-human species and a framework region from a 
human immunoglobulin molecule. 

Alternatively, techniques described for the production 
of single chain antibodies (U.S. Patent 4,946,778; Bird, 
1988, Science 242:423-426; Huston, et al., 1988, Proc. Natl. 

30 Acad. Sci. USA 85:5879-5883; and Ward, et al*. 1989, Nature 

334:544-546) can be adapted to produce single chain 
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antibodies against HKNG1 gene products. Single chain 

antibodies are formed by linking the heavy and light chain 
fragments of the Fv region via an amino acid bridge , 
resulting in a single chain polypeptide, 

Antibody fragments that recognize specific epitopes may 

5 . 

be generated by known techniques. For example, such 

fragments include but are not limited to: the F(ab , ) 2 

fragments , which can be produced by pepsin digestion of the 

antibody molecule and the Fab fragments, which can be 

generated by reducing the disulfide bridges of the F(ab') 2 

fragments. Alternatively, Fab expression libraries may be 

10 

constructed (Huse, et al., 1989, Science 246:1275-1281) to 

allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity. 

5.4. USES OF HKNG1 GENE SEQUENCES 
GENE PRODUCTS * AND &NTXBQDXES 

15 

Described herein are various applications of HKNG1 gene 
sequences, HKNG1 gene products, including peptide fragments 
and fusion proteins thereof, and of antibodies directed 
against HKNG1 gene products and peptide fragments thereof. 
Such applications include, for example, mapping of chromosome 
20 18p, prognostic and diagnostic evaluation of HKNG1 -mediated 

disorders, including CNS-related disorders, e«g» . 
neuropsychiatric disorders, such as BAD or schizophrenia, 
modulation of HKNG2 -related processes, and the identification 
of subjects with a predisposition to such disorders, as 

_ described, below, in Section 5.5* 
25 

Additionally, such applications include methods for the 
treatment of a HKNG1 -mediated disorders, such as BAD or 

schizophrenia, as described, below, in Section 5.9, and for 
the identification of compounds that modulate the expression 
of the HKNG1 gene and/or the synthesis or activity of the 
30 HKNG1 gene product, as described below, in Section 5*8 ♦ Such 
compounds can include, for example, other cellular products 
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that are involved in such processes as mood regulation and in 
HKNG1 -mediated disorders, e.g. , neuropsychiatry disorders 

such as BAD or schizophrenia. These compounds can be used, 
for example, in the amelioration of HKNG1 -mediated disorders 

_ and for the modulation of HKNG1 -mediated processes. 

Uses of the HKNG1 gene sequences, HKNG1 gene products, 
including peptide fragments and fusion proteins thereof , and 
of antibodies directed against HKNG1 gene products and/ or 
peptide fragments thereof also include prognostic and 
diagnostic evaluation of a HKNGl -mediated myopia disorder 

10 

such as early-onset autosomal dominant myopia, methods for 
the treatment of a HKNG1 -medi a ted myopia disorder, and for 

the identification of compound that modulate the expression 
of the HKNG1 gene and/ or the synthesis or activity of the 

HKNG1 gene product and could therefore be used in the 
X5 amelioration of a HENG2 -mediated myopia such as early-onset 
autosomal dominant myopia. Indeed, such methods are 
siabs t ant ially identical to the methods described, below, in 
Sections 5.5, 5.8, and 5.9 for the diagnosis and treatment of 
BKNG1 -mediated disorders. 

20 5,5. DIAGNOSIS OF BKNG1 —MEDIATED 

DISORDERS 

A variety of methods can be employed for the diagnostic 
and prognostic evaluation of #2CJtfG2 -mediated disorders, e.cr» , 

neuropsychiatry disorders and for the identification of 
subjects having a predisposition to such disorders. 

25 

Such methods may, for example, utilize reagents such as 
the HKHG1 gene nucleotide sequences described in Sections 

5.1, and antibodies directed against HKNG1 gene products , 
including peptide fragments thereof, as described, above, in 
Section 5.3. Specifically/ such reagents may be used, for 
30 example, for: 
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(1) the detection of the presence of HKNG1 gene 
mutations, or the detection of either over- or under- 
expression of HKNGl gene relative to wild-type HKNGl levels 

of expression; 

_ (2) the detection of over- or under-abundance of HKNG1 

gene product relative to wild-type abundance of HKNGl gene 
product; and 

(3) the detection of an aberrant level of HKNGl gene 

product activity relative to wild-type HKNGl gene product 

activity levels. 
*° HKNGl gene nucleotide sequences can, for example, be 

used to diagnose a HKNGl -mediated neuropsychiatric disorder 

using, for example, the techniques for HKNGl 

mutation/polymorphism detection described above in Section 
5.1, and in Section 5,6 belpw, 

15 Mutations at a number of different genetic loci may lead 

to phenotypes related to neuropsychiatric disorders* 
Ideally, the treatment of patients suffering from such 
neuropsychiatric disorder will be designed to target the 
particular genetic loci containing the mutation mediating the 
disorder. Genetic polymorphisms have been linked to 

20 differences in drug effectiveness. Thus, identification of 
alterations in the HKNGl gene, protein or gene flanking 

regions, can be utilized in pharmacogenetic methods to 
optimize therapeutic drug treatments. 

In one embodiment of the present invention, therefore, 
alterations, i.e., polymorphisms , in the HKNGl gene or 

25 

protein encoded by genes comprising such polymorphisms, are 

associated with a drug or drugs 1 efficacy, tolerance, or 

toxicity, and may be used in pharmacogenomic methods to 

optimize therapeutic drug treatments, including therapeutic 

drug treatments for one of the disorders described herein, 

- e.gr., HKNGl -mediated disorders such as schizophrenia and BAD* 
30 

Such polymorphisms can be used, for example, to refine the 
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design of drugs by decreasing the incidence of adverse events 
in drug tolerance studies, e.gr. , by identifying patient 

subpopulations of individuals who respond or do not respond 
to a particular drug therapy in efficacy studies, wherein the 
subpopulations have a HKNGl polymorphism associated with drug 
responsiveness or unresponsiveness. The pharmacogenomic 
methods of the present "invention can also provide tools to 
identify new drug targets for designing drugs and to optimize 
the use of already existing drugs, e.g., to increase the 

response rate to a drug and/ or to identify and exclude non- 
10 responders from certain drug treatments (e.g., individuals 
having a particular HKNGl polymorphism associated with 
unresponsiveness or inferior responsiveness to the drug 
treatment) or to decrease the undersireable side effects of 
certain drug treatments and/ or to identify and exclude 
individuals with marked susceptibility to such side effects 
15 (e.g., individuals having a particular HKNGl polymorphism 

associated with an undersirable side effect to the drug 
treatment) . 

In an embodiment of the present invention, polymorphisms 
in the HKNGl gene sequence or flanking this sequence, or 

20 variations in HKNG1 gene expression, or activity, e.g. , 

variations due to altered methylation, differential splicing, 
or post-translational modification of the HKNGl gene product, 
may be utilized to identify an individual having a disease or 
condition resulting from a HKNGl -mediated disorder and thus 
define the most effective and safest drug treatment. Assays 
25 such as those described herein may be used to identify such 
polymorphisms or variations in HKNGl gene expression or 

activity. Once a polymorphism in the HKNGl gene or in a 
flanking sequence in linkage disequilibrium with a disorder- 
causing allele, or a variation in HKNGl gene expression has 
30 been identified in an individual, an appropriate drug 
treatment can be prescribed to the individual. 
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For the detection of HKNGl gene mutations or 
polymorphisms, any nucleated cell can be used as a starting 
source for genomic nucleic acid. For the detection of HKNGl 

gene expression or HKNGl gene products, any cell type or 
5 tissue in which the HKNGl gene is expressed may be utilized. 

Nucleic acid-based detection techniques are described, 
below, in Section 5.6. Peptide detection techniques are 
described, below, in Section 5.7. 

The methods described herein may be performed, for 
example, by utilizing pre-packaged diagnostic kits. The 
XO invention therefore also encompasses kits for detecting the 
presence of a polypeptide or nucleic acid of the invention in 
a biological sample (i.e., a test sample). Such kits can be 

used, e.g., to determine if a subject is suffering from or is 

at increased risk of developing a disorder associated with a 
disorder-causing allele, or aberrant expression or activity 
of a polypeptide of the invention (e.g., a CNS disorder, 

including a neurospychiatric disorder such as BAD or 
schizophrenia) . For example, the kit can comprise a labeled 
compound or agent capable of detecting the polypeptide or 
mRNA or DNA or HKNGl gene sequences, e.g. . encoding the 

20 polypeptide in a biological sample. The kit can further 
comprise a means for determining the amount of the 
polypeptide or mRNA in the sample (e.gr., an antibody which 

binds the polypeptide or an oligonucleotide probe which binds 
to DNA or mRNA encoding the polypeptide) * Kits can also 
include instructions for observing that the tested subject is 
25 suffering from or is at risk of developing a disorder 

associated with aberrant expression of the polypeptide if the 
amount of the polypeptide or mRNA encoding the polypeptide is 
above or below a normal level, or if the DNA correlates with 
presence of a HKNGl allele that causes a disorder. 

For antibody-based kits, the kit can comprise, for 
30 example: (1) a first antibody (e.g-., attached to a solid 
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support) which binds to a polypeptide of the invention; and, 
optionally , (2) a second, different antibody which binds to 
either the polypeptide or to the first antibody and is 
conjugated to a detectable agent. 

For oligonucleotide-based kits, the kit can comprise, 
5 for example? (1) an oligonucleotide (e.g., a detectably 
labeled oligonucleotide)* which hybridizes to a nucleic acid 
sequence encoding a polypeptide of the invention, or (2) a 
pair of primers, such as the primers recited in Table 1, 
useful for amplifying a nucleic acid molecule encoding a 
polypeptide of the invention. 
10 The kit can also comprise, for example, one or more 

buffering agents, preservatives, or protein stabilizing 
agents. The kit can also comprise components necessary for 
detecting the detectable agent (e.g., an enzyme or a 
substrate). The kit can also contain a control sample or a 
series of control samples which can be assayed and compared 
to the test sample. Each component of the kit is usually 
enclosed within an individual container and all of the 
various containers are within a single package along with 
instructions for observing whether the tested subject is 
suffering from or is at risk of developing a disorder 
associated with polymorphisms that correlate with alleles 

20 

that cause HKNG1 -related disorders, and/or aberrant levels of 
HKNG1 mRNA, polypeptides or activity. 

5.6\ DETECTION OF HKNG1 NUCLEIC ACID MOIiECUIiEB 
A variety of methods can be employed to screen for the 
25 presence of HKNG1 gene-specific mutations or polymorphisms 
(including polymorphisms flanking HKNG1 gene) and to detect 
and/or assay levels of HKNG1 nucleic acid sequences. 

Mutations or polymorphisms within or flanking the HKNG1 
- gene can be detected by utilizing a number of techniques. 
30 Nucleic acid from any nucleated cell can be used as the 

starting point for such assay techniques, and may be isolated 
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according to standard nucleic acid preparation procedures 
that are well known to those of skill in the art. 
HKNGl nucleic acid sequences may be used in 

hybridization or amplification assays of biological samples 
to detect abnormalities involving HKNGl gene structure, 

5 

including point mutations, insertions, deletions, inversions, 
translocations and chromosomal rearrangements. Such assays 
may include, but are not limited to, Southern analyses, 
single-stranded conformational polymorphism analyses (SSCP) , 
and PCR analyses. 

Diagnostic methods for the detection of HKNGl gene- 
specific mutations or polymorphisms can involve for example, 
contacting and incubating nucleic acids obtained from a 
sample, e.g., derived from a patient sample or other 

appropriate cellular source with one or more labeled nucleic 
acid reagents including recombinant DNA molecules, cloned 
15 genes or degenerate variants thereof, such as described in 
Section 5.1, above, under conditions favorable for the 
specific annealing of these reagents to their complementary 
sequences within or flanking the HKNGl gene. The diagnostic 

methods of the present invention further encompass contacting 
and incubating nucleic acids for the detection of single 
20 nucleotide mutations or polymorphisms of the HKNGl gene. 
Preferably, these nucleic acid reagent sequences within the 
HKNGl gene, or chromosome I8p nucleotide sequences flanking 

the HKNGl gene are 15 to 30 nucleotides in length. 

After incubation, all non-annealed nucleic acids are 
25 removed from the nucleic acid i HKNGl molecule hybrid • The 
presence of nucleic acids that have hybridized , if any such 
molecules exist, is then detected* Using such a detection 
scheme, the nucleic acid from the cell type or tissue of 
interest can be immobilized, for example, to a solid support 
- such as a membrane, or a plastic surf ace such as that on a 
30 microtiter plate or polystyrene beads. In this case, after 
incubation, non-annealed, labeled nucleic acid reagents of 
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the type described in Section 5.1 are easily removed* 
Detection of the remaining , annealed , labeled HKNG1 nucleic 

acid reagents is accomplished using standard techniques well- 
known to those in the art. The HKNGl gene sequences to which 

the nucleic acid reagents have annealed can be compared to 

5 

the annealing pattern expected from a normal HKNG1 gene 

> 

sequence in order to determine whether a HKNGl gene mutation 
is present. 

In a preferred embodiment, HKNG1 mutations or 

polymorphisms can be detected by using .a microassay of HKNG1 

10 nucleic acid sequences immobilized to a substrate or "gene 
chip" (see, e.g. Cronin, et al., 1996, Human Mutation 7:244- 

255). 

Alternative diagnostic methods for the detection of 
HKNG1 gene-specific nucleic acid molecules (or HKNG1 flanking 

15 sequences) , in patient samples or other appropriate cell 
sources, may involve their amplification, e.gr., by PCR (the 

experimental embodiment set forth in Mullis, 1987, U.S. 
Patent No. 4,683,202), followed by the analysis of the 
amplified molecules using techniques well known to those of 
skill in the art, such as, for example, those listed above. 
20 The resulting amplified sequences can be compared to those 
that would be expected if the nucleic acid being amplified 
contained only normal copies of the HKNGl gene in order to 
determine whether a HKNGl gene mutation or polymorphism in 
linkage disequilibrium with a disease-causing HKNGl allele 

„ exists. 
25 

Among those HKNGl nucleic acid sequences which are 
preferred for such amplification-related diagnostic screening 
analyses are oligonucleotide primers which amplify HKNGl exon 
sequences. The sequences of such oligonucleotide primers 
_ are, therefore, preferably derived from HKNGl intron 

30 sequences so that the entire exon, or coding region, can be 
analyzed as discussed below. Primer pairs useful for 
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amplification of HKNG1 exons are preferably derived from 

adjacent introns. Appropriate primer pairs can be chosen 
such that each of the eleven HJKNGl exons are amplified. 

Primers for the amplification of HKNG1 exons can be routinely 
5 designed by one of ordinary skill in the art by utilizing the 
exon and intron sequences of HKNG1 shown in Figure 3A-3R. 

As an example, and not by way of limitation, Table 1, 
below, lists primers and primer pairs which can be utilized 
for the amplification of each of the human HKGN1 exons one 

through eleven* In this table, a primer pair is listed for 
0 each exon which consists of a forward primer derived from 
intron sequence upstream of the exon to be amplified, and a 
reverse primer derived from intron sequence downstream of the 
exon to be amplified. For exons greater than about 300 base 
pairs in length, Le. P exons 4 and 7, two primer pairs are 
listed (marked 4a , 4b, 7a and 7b) . Each of the primer pairs 
5 can be utilized, therefore, as part of a standard PGR 
reaction to amplify an individual HKNGl exon (or portion 

thereof). Primer sequences are depicted in a 5' to 3' 
orientation. 



20 



25 



30 
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TABLE 1 



Primer Sequence 



cggggttggtttccacc 



(SEQ ID NO; 8) 1 forward 



gcgaggagagaaat ctqqq 



(SEQ ID NO: 9) 1 reverse 



2 tgctcactactttgcag±gttc 



(SEQ ID NO: 10) I forward 



tgagatcgtgtcactgcattct 



(SEQ ID NO: 11) I reverse 



3 gtaaatctcaaaatgttgggttaatag (SEQ ID NO: 12) | forward 



10 



ctaactcttcttctatcattactc (SEQ' ID NO: 13) j reverse 



4A tgtttattgtgtgtctgctgtg 



(SEQ ID NO: 14) forward 



ggacaa ccaaca-bgcaaacaq 



(SEQ ID NO: 15) reverse 



4B I cccaggtgirfrttcaatitgatgc 



(SEQ ID NO: 16) 1 foward 



15 



agcagttttgtccttccaagtq 



(SEQ ID NO: 17) | reverse 



5 gtgttttgtaatctgatcagatctc (SEQ ID NO: 18) | forward 



gcagta-b-fctctggtccagatc 



(SEQ ID NO: 19) reverse 



20 



ggtgcacatagatcatgaaatgq 



(SEQ ID NO: 20) | forward 



taagcrtgaaa'taggtgccrt-taag 



(SEQ ID NO: 21) I reverse 



7A tttatfcccattstctigtcccctac 



(SEQ ID NO:22) | forward 



aaggc-fccagttagqtctgtatc 



(SEQ ID NO: 23) I reverse 



25 



7B caggagttttaacgtcttcagac 



(SEQ ID NO: 24) I forward 



gactcagaaatgfcctaccatttc 



(SEQ ID NO: 25) I reverse 



8 tgbctccacttct-fccaaagtgc 



(SEQ ID NO: 26) | forward 



30 



caa 



aatgtacctgagaacttaaag (SEQ ID NO: 27) reverse 
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5 





Primer Sequence 


9 


cacctccaagtttcatggac (SEQ ID NO: 28) 


f orward 




caaggtatgcacgtgtcatttc (SEQ ID NO: 29) 


reverse 








10 


gaatgtgtattgggatttagtaaac (SEQ ID NO: 30) 


forward 




ttgagaattaactattcctgtcaac (SEQ ID NO: 31) 


reverse 








11 


ccatcctggacttttactcc (SEQ ID NO: 32) 


forward 




ctttcctgcaactgtgtttattg (SEQ ID NO: 33) 


reverse 



10 

Each primer pair above can be used to generate an 
amplified sequence of about 300 base pairs. This is 
especially desirable in instances in which sequence analysis 
is performed using SSCP gel electrophoretic procedures, in 
that such procedures work optimally using sequences of about 
i5 300 base pairs or less. 

Additional HKNG1 nucleic acid sequences which are 
preferred for such amplification-related analyses are those 
which will detect the presence of an HKNG1 polymorphism which 
differs from the HKUG1 sequence depicted in FIG. 3A-3R. Such 
polymorphisms include ones which represent mutations 

20 

associated with an HKNG1 -mediated neuropsychiatry disorder, 
such as BAD or schizophrenia. Por example, a single base 
mutation identified in the Example presented in Section 8, 
below, results in a mutant EKNG1 gene product comprising 
substitution of a lysine residue for the wild-type glutamic 
25 acid residue at amino acid position 202 of the HKNG1 amino 
acid sequence shown in FIG. 1A-1B (SEQ ID NO: 2) or amino acid 
position 184 of the HKNG1 amino acid sequence shown in FIG. 
2A-2B (SEQ ID NO: 4) . Such polymorphisms also include ones 
that correlate with the presence of a HKNGl-mediated 
, neuropsychiatry disorder, e«gr,, polymorphisms that are in 

30 

Ixnkage disequilibrium with disorder-causing HKNG1 alleles. 
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Amplification techniques are well known to those of 
skill in the art and can routinely be utilized in connection 
with primers such as those listed in Table 1 above. In 
general, hybridization conditions can be as follows. 
In general, for probes between 14 and 70 nucleotides in 
5 length the melting temperature TM is calculated using the 
formula: Tm( 0 C)=81.5+16; 6 (log [monovalent cations] )+0. 41 (% 
G+C)-(500/N) where N is the length of the probe. If the 
hybridization is carried out in a solution containing 
formamide, the melting temperature is calculated using the 
equation Tm(°C) "81 .5+16. 6 (log [monovalent cations] ) +0*41 (% 
10 G+C)-(0.61% formamide)- (500/N) where NT is the length of the 
probe • 

Additionally, well-known genotyping techniques can be 
performed to identify individuals carrying HKNG1 gene 

mutations. Such techniques include, for example, the use of 
restriction fragment length polymorphisms (RFLPs) , which 

is 

involve sequence variations in one of the recognition sites 
for the specific restriction enzyme used. 

Further, improved methods for analyzing DNA 
polymorphisms, which can be utilized for the identification 
of HKNG1 gene-specific mutations, have been described that 
capitalize on the presence of variable numbers of short, 

20 

tandemly repeated DNA sequences between the restriction 
enzyme sites. For example, Weber (U.S. Pat. No* 5,075,217) 
describes a DNA marker based on length polymorphisms in 
blocks of (dC-dA)n-(dG-dT)n short tandem repeats. The 
average separation of (dC-dA) n- (dG-dT) n blocks is estimated 
to be 30,000-60,000 bp* Markers that are so closely spaced 

25 

exhibit a high frequency co-inheritance, and are extremely 
useful in the identification of genetic mutations, such as, 
for example, mutations within the HKNG1 gene, and the 
diagnosis of diseases and disorders related to HKNG1 
- mutations* 

30 Also, Caskey et al . (U.S. Pat. No. 5,364,759) describe a 

DNA profiling assay for detecting short tri and tetra 
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nucleotide repeat sequences. The process includes extracting 
the DNA of interest , such as the HKNG1 gene, amplifying the 

extracted DNA, and labelling the repeat sequences to form a 
genotypic map of the individual's DNA. 

Other methods well known in the art may be used to 
identify single nucleotide polymorphisms (SNPs), including 
biallelic SNPs or biallelic markers which have two alleles, 
both of which are present at a fairly high frequency in a 
population. Conventional techniques for detecting SNPs 
include, e.g., conventional dot blot analysis, single 

ao stranded conformational polymorphism (SSCP) analysis (see, 
e.g., Orita et al., 1989, Proc. JJatl. Acad* Sci. USA 86:2766- 

2770) , denaturing gradient gel electrophoresis (DGGE) , 
heterodulex analysis, mismatch cleavage detection, and other 
routine techniques well known in the art (see, e.gr. , 

Sheffield et al., 1989, Proc. IJatZ. Acad. Sci. 86:5855-5892; 

15 Grompe, 1993, Nature Genetics 5:111-117). Alternative, 
preferred methods of detecting and mapping SNPs involve 
microsequencing techniques wherein an SNP site in a target 
DNA is detecting by a single nucleotide primer extension 
reaction (see, e.g., Goelet et al., PCT Publication No. 

20 W092/15712; Mundy, U.S. Patent No. 4,656,127; Vary and 
Diamond, U.S. Patent No. 4,851,331; Cohen et al., PCT 

Publication No. WO91/02087; Chee et al*, PCT Publication No. 

W095/ 11995; Landegren et al., 1988, Science 241:1077-1080; 

Nicerson et al., 1990, Proc. Natl. Acad. Sci. U.S.A. 87 z 8923- 

8927; Pastinen et al. ,1997, Genome Res. 7:606-614; Pastinen 

25 

et al., 1996, Clin. Chem. 42:1391-1397; Jalanko et al., 1992, 

Clin. Chem. 38:39-43; Shumaker et al., 1996, Hum. Mutation 

7:346-354; Caslcey et al., PCT Publication No. WO 95/00669). 

The level of HKNG1 gene expression can also be assayed. 

For example, RNA from a cell type or tissue known, or 
suspected, to express the HKNG1 gene, such as brain, may be 
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isolated and tested utilizing hybridization or PCR techniques 
such as are described , above. The isolated cells can be 
derived from cell culture or from a patient. The analysis of 
cells taken from culture may be a necessary step in the 
assessment of cells to be used as part of a cell-based gene 
5 therapy technique or, alternatively, to test the effect of 
compounds on the expression of the HKNG1 gene. Such analyses 
may reveal both quantitative and qualitative aspects of the 
expression pattern of the HKNG1 gene, including activation or 

inactivation of HKNG1 gene expression. 

10 In one embodiment of such a detection scheme, a cDNA 

molecule is synthesized from an RNA molecule of interest 
(e.gr., by reverse transcription of the RNA molecule into 

cDNA) . A sequence within the cDNA is then used as the 
template for a nucleic acid amplification reaction, such as a 
PCR amplification reaction, or the like. The nucleic acid 
15 reagents used as synthesis initiation reagents (e.g., 

primers) in the reverse transcription and nucleic acid 
amplification steps of this method are chosen from among the 
HKNGl gene nucleic acid reagents described in Section 5.1. 

The preferred lengths of such nucleic acid reagents are at 
least 9-30 nucleotides. For detection of the amplified 

20 

product, the nucleic acid amplification may be performed 
using radioactively or non-radioactively labeled nucleotides. 
Alternatively, enough amplified product may be made such that 
the product may be visualized by standard ethidium bromide 
staining or by utilizing any other suitable nucleic acid 
staining method. 

25 

Additionally, it is possible to perform such HKNGl gene 
expression assays °in situ", i.e., directly upon tissue 
sections (fixed and/or frozen) of patient tissue obtained 
from biopsies or resections, such that no nucleic acid 
purification is necessary. Nucleic acid reagents such as 
30 those described in Section 5.1 may be used as probes and/or 
primers for such in situ procedures (see, for example, Nuovo, 
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G.J., 1992, "PCR in Situ Hybridization: Protocols And 
Applications", Raven Press, NY) . 

Alternatively, if a sufficient quantity of the 
appropriate cells can be obtained, standard Northern analysis 
can be performed to determine the level of mRNA expression of 
5 the HKNGl gene. 

5*7. DETECTION OF HKNGl GENE PRODUCTS 
Antibodies directed against unimpaired or mutant HKNG2 
gene products or conserved variants or peptide fragments 

10 thereof, which are discussed, above, in Section 5.3 , may also 
be used as diagnostics and prognostics for a HKNGl -mediated 
disorder, e.g. , a neuropsychiatric disorder such as BAD or 
schizophrenia. Such methods may be used to detect 
abnormalities in the level of HKNGl gene product synthesis or 
expression, or abnormalities in the structure, temporal 

15 expression, and/or physical location of HKNGl gene product. 
The antibodies and immunoassay methods described herein have, 
for example, important In vitro applications in assessing the 
efficacy of treatments for HKNGl -mediated disorders. 
Antibodies, or fragments of antibodies, such as those 

20 described below, may be used to screen potentially 

therapeutic compounds in vitro to determine their effects on 
HKNGl gene expression and HKNGl gene product production. The 
compounds that have beneficial effects on a HKNGl -mediated 
disorder, such as BAD or schizophrenia. 

In vitro immunoassays may also be used, for example, to 

25 

assess the efficacy of cell-based gene therapy for a HKNGl- 

mediated disorder, e.g. r a neuropsychiatric disorder, such as 
BAD schizophrenia. Antibodies directed against HKNGl gene 

products may be used in vitro to determine, for example, the 

- level of HKNGl gene expression achieved in cells genetically 
30 

engineered to produce HKNGl gene product. In the case of 
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intracellular HKNG1 gene products, such an assessment is 

done, preferably, using cell lysates or extracts. Such 
analysis will allow for a determination of the number of 
transformed cells necessary to achieve therapeutic efficacy 
in vivo, as well as optimization of the gene replacement 

5 

protocol. 

The tissue or ceir type to be analyzed will generally 
include those that are known, or suspected, to express the 
HKNG1 gene* The protein isolation methods employed herein 

may, for example, be such as those described in Harlow and 
20 Lane (1988, 19 Antibodies : A Laboratory Manual", Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, New York) . The 
isolated cells can be derived from cell culture or from a 
patient* The analysis of cells taken from culture may be a 
necessary step in the assessment of cells to be used as part 
of a cell-based gene therapy technique or, alternatively, to 
i5 test the effect of compounds on the expression of the HKNG1 

gene. 

Preferred diagnostic methods for the detection of HKNG1 

gene products , conserved variants or peptide fragments 
thereof, may involve, for example, immunoassays wherein the 
HKNG1 gene products or conserved variants or peptide 

20 

fragments are detected by their interaction with an anti- 
HKNG1 gene product-specific antibody. 

For example, antibodies , or fragments of antibodies, 
such as those described, above, in Section 5.3, may be used 
to quantitatively or qualitatively detect the presence of 

25 HKNG1 gene products or conserved variants or peptide 

fragments thereof. This can be accomplished, for example, by 
immunofluorescence techniques employing a f luorescently 
labeled antibody (see below, this Section) coupled with light 
microscopic, flow cytometric, or fluorimetric detection. 
Such techniques are especially preferred for HKNG1 gene 

30 products that are expressed on the cell surface. 
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The antibodies (or fragments thereof) useful in the 
present invention may, additionally, be employed 
histologically , as in immunofluorescence or immunoelectron 
microscopy, for in situ detection of HKNG1 gene products, 

conserved variants or peptide fragments thereof. In situ 

5 

detection may be accomplished by removing a histological 
specimen from a patient/ and applying thereto a labeled 
antibody that binds to an rTs polypeptide. The antibody (or 
fragment) is preferably applied by overlaying the labeled 
antibody (or fragment) onto a biological sample. Through the 
■ use of such a procedure, it is possible to determine not only 
the presence of the HKNG1 gene product, conserved variants or 

peptide fragments, but also its distribution in the examined 
tissue. Using the present invention, those of ordinary skill 
will readily recognize that any of a wide variety of 
histological methods (such as staining procedures) can be 
15 modified in order to achieve in situ detection of a HKNGl 
gene product . 

Immunoassays for HKNGl gene products, conserved 

variants, or peptide fragments thereof will typically 
comprise incubating a sample, such as a biological fluid, a 
tissue extract, freshly harvested cells, or lysates of cells 
20 in the presence of a detectably labeled antibody capable of 
identifying HKNGl gene product, conserved variants or peptide 
fragments thereof , and detecting the bound antibody by any of 
a number of techniques well-known in the art. 

The biological sample may be brought in contact with and 
immobilized onto a solid phase support or carrier, such as 

25 

nitrocellulose, that is capable of immobilizing cells, cell 
particles or soluble proteins. The support may then be 
washed with suitable buffers followed by treatment with the 
detectably labeled HKNGl gene product specific antibody. The 

solid phase support may then be washed with the buffer a 
second time to remove unbound antibody. The amount of bound 
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label on the solid support may then be detected by 
conventional means. 

By "solid phase support or carrier" is intended any 
support capable of binding an antigen or an antibody* Well- 
known supports or carriers include glass, polystyrene , 
5 polypropylene, polyethylene, dextran, nylon, amylases, 
natural and modified celluloses, polyacrylamides, gabbros, 
and magnetite. The nature of the carrier can be either 
soluble to some extent or insoluble for the purposes of the 
present invention- The support material may have virtually 
any possible structural configuration so long as the coupled 

*° molecule is capable of binding to an antigen or antibody. 
Thus, the support configuration may be spherical, as in a 
bead, or cylindrical, as in the inside surface of a test 
tube, or the external surface of a rod. Alternatively, the 
surface may be flat such as a sheet, test strip, etc. 
Preferred supports include polystyrene beads. Those skilled 

15 in the art will know many other suitable carriers for binding 
antibody or antigen, or will be able to ascertain the same by 
use of routine experimentation. 

One of the ways in which the HKNG1 gene product-specific 

antibody can be detectably labeled is by linking the same to 
an enzyme, such as for use in an enzyme immunoassay (EIA) 
20 (Voller, A., "The Enzyme Linked Immunosorbent Assay (ELISA) M , 
1978, Diagnostic Horizons 2:1-7, Microbiological Associates 
Quarterly Publication, Walkersville, MD) f Voller, A. et al., 
1978, J. Clin. Pathol. 31s507-520; Butler, J.E., 1981, Meth. 
Enzymol. 73:482-523; Maggio, E. (ed.), 1980, Enzyme 
Immunoassay, CRC Press, Boca Raton, FL, ; Ishikawa, E. et aJ., 

25 

(eds.), 1981, Enzyme Immunoassay, Kgaku Shoin, Tokyo). The 
enzyme which is bound to the antibody will react with an 
appropriate substrate, preferably a chromogenic substrate, in 
such a manner as to produce a chemical moiety that can be 
- detected, for example, by spectrophotometric, fluorimetric or 
30 b y visual means. Enzymes that can be used to detectably 
label the antibody include, but are not limited to, malate 
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15 



dehydrogenase, staphylococcal nuclease, delta-5-steroid 
isomerase, yeast; alcohol dehydrogenase, a-glycerophosphate, 
dehydrogenase, triose phosphate isomer ase, horseradish 
peroxidase, alkaline phosphatase, asparaginase, glucose 
oxidase, (S-galactosidase, ribonuclease, urease, catalase, 
5 glucose-6-phosphate dehydrogenase, glucoamylase and 

acetylcholinesterase. The detection can be accomplished by 
colorimetric methods that employ a chromogenic substrate for 
the enzyme. Detection may also be accomplished by visual 
comparison of the extent of enzymatic reaction of a substrate 
in comparison with similarly prepared standards. 
10 Detection may also be accomplished using any of a 

variety of other immunoassays. For example, by radioactively 
labeling the antibodies or antibody fragments, it is possible 
to detect HKNG1 gene products through the use of a 

radioimmunoassay (RIA) (see, for example, Weintraub, B., 
Principles of Radioimmunoassays, Seventh Training Course on 
Radioligand Assay Techniques, The Endocrine Society, March, 
1986) . The radioactive isotope can be detected by such means 
as the use of a gamma counter or a scintillation counter or 
by autoradiography. 

It is also possible to label the antibody with a 
fluorescent compound. When the f luorescently labeled 
antibody is exposed to light of the proper wave length, its 
presence can then be detected due to fluorescence. Among the 
most commonly used fluorescent labeling compounds are 
fluorescein isothiocyanate, rhodamine, phycoerythrin, 
phycocyanin, allophycocyanin, o-phthaldehyde and 

f luorescamine. 

The antibody can also be detectably labeled using 
fluorescence emitting metals such as 1S2 Eu, or others of the 
lanthanide series. These metals can be attached to the 
antibody using such metal chelating groups as 
diethylenetriaminepentacetic acid (DTPA) or 
3o ethylenediaminetetraacetic acid (EDTA) . 



20 



25 
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The antibody also can be detectably labeled by coupling 
it to a chemiluminescent compound. The presence of the 
chemiluminescent- tagged antibody is then determined by 
detecting the presence of luminescence that arises during the 
course of a chemical reaction* Examples of particularly 
5 useful chemiluminescent labeling compounds are luminol , 
isoluminol, theromatic acridiniura ester, imidazole, 
acridinium salt and oxalate ester. 

Likewise, a bioluminescent compound may be used to label 
the antibody of the present invention. Bioluminescence is a 
type of chemiluminescence found in biological systems in 
10 which a catalytic protein increases the efficiency of the 
chemiluminescent reaction. The presence of a bioluminescent 
protein is determined by detecting the presence of 
luminescence. Important bioluminescent compounds for 
purposes of labeling are luciferin, lucif erase and aequorin. 

15 5.8. SCREENING ASSAYS FOR COMPOUNDS 

THAT MODULATE HKNGl GENE ACTIVXTY 

The following assays are designed to identify compounds 
that bind to a HKNG1 gene product, compounds that bind to 

proteins, or portions of proteins that interact with a HKNGl 

gene product, compounds that modulate, e.gr., interfere with, 

*> u 

the interaction of a HKNGl gene product with proteins and 

compounds that modulate the activity of the HKNGl gene (i.e., 

modulate the level of HKNGl gene expression and/or modulate 

the level of HKNGl gene product activity) . Assays may 

25 additionally be utilized that identify compounds that bind to 
HKNGl gene regulatory sequences (e.g., promoter sequences; 

see e-gr., Piatt, 1994, J. Biol. Chem. 269, 28558-28562), and 
that can modulate the level of HKNGl gene expression. Such 
compounds may include, but are not limited to, small organic 
.molecules, such as ones that are able to cross the blood- 
30 brain barrier, gain to and/or entry into an appropriate cell 
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and affect expression of the HKNG1 gene or some other gene 
involved in a HKNG1 regulatory pathway. 

Methods for the identification of such proteins are 
described, below, in Section 5*8.2. Such proteins may be 
involved in the control and/or regulation of mood. Further, 
among these compounds are compounds that affect the level of 
HKNG1 gene expression and/ or HKNG1 gene product activity and 
that can be used in the therapeutic treatment of HKNG1- 

mediated disorders, e.g. , neuropsychiatric disorders such as 
BAD and schizophrenia as described, below, in Section 5.9. 
10 Compounds may include, but are not limited to, peptides 

such as, for example, soluble peptides, including but not 
limited to, Xg-tailed fusion peptides, and members of random 
peptide libraries; (see, e.g\, Lam, et al., 1991, Nature 

354:82-84; Houghten, et al . , 1991, Nature 354:84-86), and 

combinatorial chemistry-derived molecular library made of D- 

15 

and/ or L~ configuration amino acids, phosphopeptides 
(including, but not limited to members of random or partially 
degenerate, directed phosphopeptide libraries; see, e.g., 

Songyang, et al., 1993, Cell 72:767-778), antibodies 

(including, but not limited to, polyclonal, monoclonal, 
20 humanized, anti-idiotypic, chimeric or single chain 

antibodies, and FAb, F(ab') 2 and FAb expression library 

fragments, and epitope-binding fragments thereof) , and small 

organic or inorganic molecules. 

Such compounds may further comprise compounds, in 

particular drugs or members of classes or families of drugs, 
25 known to ameliorate the symptoms of a HKWGI-mediated 

disorder, e.g. f a neuropsychiatric disorder such as BAD or 
schizophrenia. 

Such compounds include families of antidepressants such 
as lithium salts, carbamazepine, valproic acid, lysergic acid 
- diethylamide (LSD) , p-chlorophenylalanine , p- 
30 propyldopacet amide dithiocarbamate derivatives e.g., FLA 63; 
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anti-anxiety drugs, e.g., diazepam; monoamine oxidase (MAO) 
inhibitors, e.g., iproniazid, clorgyline, phenelzine and 
isocarboxazid; biogenic amine uptake blockers, e.g., 
tricyclic antidepressants such as desipramine, imipramine and 
s ami tripty line; serotonin reuptake inhibitors e.g., 
fluoxetine; antipsychotic drugs such as phenothiazine 
derivatives (e.g., chlorpromazine (thorazine) and 

trif luoproraazine) ) , butyrophenones (e.g., haloperidol 
(Haldol)), thioxanthene derivatives (e.g., chlorprothixene) , 
and dibenzodiazepines (e.g., clozapine); benzodiazepines; 
dopaminergic agonists and antagonists e.g., Ij-DOPA, cocaine, 
amphetamine, a-methyl-tyrosine, reserpine, tetrabenazine, 
benzotropine , pargyline; noradrenergic agonists and 
antagonists e.g. , clonidine, phenoxybenzamine , phentolamine, 
tropolone. 

15 Compounds identified via assays such as those described 

herein may be useful, for example, in elaborating the 
biological function of the HKNG1 gene product and for 
ameliorating EKNG1 -mediated neuropsychiatry disorders, such 
as BAD and schizophrenia. Assays for testing the 
effectiveness of compounds identified by, for example, 
techniques such as those described in Sections 5.8.1 - 5.8.3, 
are discussed, below, in Section 5.8.4. 

5*8.1. IN VITRO SCREENING ASSAYS FOR COMPOUNDS 
THAT BIND TO THE HKJSTG1 GENE PRODUCT? 



20 



25 



In vitro systems may be designed to identify compounds 

capable of binding the HKNG1 gene products of the invention. 

Compounds identified may be useful, for example, in 
modulating the activity of unimpaired and/ or mutant HKNG1 

gene products, may be useful in elaborating the biological 

-function of the HKNG1 gene product, may be utilized in 
30 

screens for identifying compounds that disrupt normal HKNG1 



- 68 - 



WO 99/47535 



PCT/US99/05606 



gene product: interactions , or may in themselves disrupt such 
interactions . 

The principle of the assays used to identify compounds 
that bind to the HKNGl gene product involves preparing a 
reaction mixture of the HKNG1 gene product and the test 

5 

compound under conditions and for a time sufficient to allow 
the two components to interact and bind, thus forming a 
complex that can be removed and/or detected in the reaction 
mixture. These assays can be conducted in a variety of ways* 
For example , one method to conduct such an assay involves 
anchoring a HKNGl gene product or a test substance onto a 
solid support and detecting HKNGl gene product/test compound 
complexes formed on the solid support at the end of the 
reaction. Xn one embodiment of such a method, the HKNGl gene 
product may be anchored onto a solid support, and the test 
compound, which is not anchored, may be labeled, either 

i5 directly or indirectly * 

In practice, microtiter plates are conveniently 
utilized as the solid support. The anchored component may be 
immobilized by non-covalent or covalent attachments. Non- 
covalent attachment may be accomplished by simply coating the 
solid surface with a solution of the protein and drying. 

20 Alternatively, an immobilized antibody, preferably a 
monoclonal antibody, specific for the protein to be 
immobilized may be used to anchor the protein to the solid 
surface. The surfaces may be prepared in advance and stored. 

In order to conduct the assay, the non-immobilized 
component is added to the coated surface containing the 

25 anchored component. After the reaction is complete, 

unreacted components are removed (e.g., by washing) under 
conditions such that any complexes formed will remain 
immobilized on the solid surface. The detection of complexes 
_ anchored on the solid surface can be accomplished in a number 
of ways. Where the previously non-immobilized component is 

30 pre-labeled, the detection of label immobilized on the 
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surface indicates that complexes were formed. Where the 
previously non- immobilized component is not pre-labeled, an 
indirect label can be used to detect complexes anchored on 
the surface; e.g., using a labeled antibody specific for the 
previously non- immobilized component (the antibody, in turn, 
5 may be directly labeled or indirectly labeled with a labeled 
anti-Ig antibody). 

Alternatively, a reaction can be conducted in a liquid 
phase, the reaction products separated from unreacted 
components, and complexes detected; e.g., using an 
immobilized antibody specific for HKNG1 gene product or the 

10 

test compound to anchor any complexes formed in solution, and 
a labeled antibody specific for the other component of the 
possible complex to detect anchored complexes. 

5.8*2* ASSAYS FOR PROTEINS THAT IKTERACT 
WITH HKNG1 GENE PRODUCTS 

15 

Any method suitable for detecting protein-protein 
interactions may be employed for identifying HKNG1 gene 

product-protein interactions. 

Among the traditional methods that may be employed are 
co-immunoprecipitation, cross-linking and co-purification 

20 through gradients or chromatographic columns. Utilizing 
procedures such as these allows for the identification of 
proteins, including intracellular proteins, that interact 
with HKNG1 gene products. Once isolated, such a protein can 
be identified and can be used in conjunction with standard 
techniques, to identify proteins it interacts with. For 

25 example, at least a portion of the amino acid sequence of a 
protein that interacts with the HKNG1 gene product can be 
ascertained using techniques well known to those of skill in 
the art, such as via the Edman degradation technique (see, 
e.g., Creighton, 1933, "Proteins: Structures and Molecular 
Principles," W.H. Freeman & Co., N.Y., pp. 34-49). The amino 
acid sequence obtained may be used as a guide for the 
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generation of oligonucleotide mixtures that can be used to 
screen for gene sequences encoding such proteins. Screening 
made be accomplished, for example , by standard hybridization 
or PCR techniques. Techniques for the generation of 
oligonucleotide mixtures and the screening are well-known. 
5 (See, e.g., Ausubel, supra, and 1990, "PCR Protocols: A 
Guide to Methods and Applications," Xnnis, et al., eds. 
Academic Press, Inc. , New York) . 

Additionally, methods may be employed that result in the 
simultaneous identification of genes that encode a protein 
which interacts with a HKNGl gene product. These methods 
include, for example, probing expression libraries with 
labeled HKNGl gene product, using HKNGl gene product in a 

manner similar to the well known technique of antibody 
probing of Xgtll libraries. 

One method that detects protein interactions in vivo, 
15 the two-hybrid system, is described in detail for 

illustration only and not by way of limitation. One version 
of this system has been described (Chien, et al., 1991, Proc. 

Natl* Acad. Sci. USA, 88:9578-9582) and is commercially 
available from Clontech (Palo Alto, CA) * 

Briefly, utilizing such a system, plasmids are 

20 

constructed that encode two hybrid proteins: one consists of 
the DNA-binding domain of a transcription activator protein 
fused to the HKNGl gene product and the other consists of the 

transcription activator protein* s activation domain fused to 
an unknown protein that is encoded by a cDNA that has been 
as x^Goubixied into this plasmid as part of a cDNA library. The 
DNA-binding domain fusion plasmid and the cDNA library are 
transformed into a strain of the yeast Saooharomyces 

cerevisiae that contains a reporter gene (e.g., HBS or lacZ) 
whose regulatory region contains the transcription 
activator's binding site. Either hybrid protein alone cannot 
30 activate transcription of the reporter gene: the DNA-binding 
domain hybrid cannot because it does not provide activation 



- 71 - 



WO 99/47535 



PCT/US99/05606 



function and the activation domain hybrid cannot because it 
cannot localize to the activator^ binding sites. 
Interaction of the two hybrid proteins reconstitutes the 
functional activator protein and results in expression of the 
reporter gene, which is detected by an assay for the reporter 
5 gene product. 

The two-hybrid system or related methodologies may be 
used to screen activation domain libraries for proteins that 
interact with the "bait" gene product* By way of example, 
and not by way of limitation, HKNG2 gene products may be used 

as the bait gene product. Total genomic or cDNA sequences 
10 are fused to the DNA encoding an activation domain. This 
library and a plasmid encoding a hybrid of a bait KKNG1 gene 

product fused to the DNA-binding domain are co-transformed 
into a yeast reporter strain, and the resulting transformants 
are screened for those that express the reporter gene . For 
example , a bait HKNG1 gene sequence, such as the open reading 

frame of the HKNG1 gene, can be cloned into a vector such 

that it is trans lationally fused to the DNA encoding the DNA- 
binding domain of the GAL4 protein* These colonies are 
purified and the library plasmids responsible for reporter 
gene expression are isolated. DNA sequencing is then used to 
ao identify the proteins encoded by the library plasmids. 

A cDNA library of the cell line from which proteins that 
interact with bait HKNG1 gene product are to be detected can 

be made using methods routinely practiced in the art. 
According to the particular system described herein, for 
example, the cDNA fragments can be inserted into a vector 

25 such that they are translationally fused to the 

transcriptional activation domain of GAL4 . Such a library 
can be co- trans formed along with the bait HKNG1 gene-GAL4 
fusion plasmid into a yeast strain that contains a lacZ gene 
driven by a promoter that contains GAIj4 activation sequence. 
A cDNA encoded protein, fused to a GAL4 transcriptional 

30 activation domain that interacts with bait HENG1 gene product 
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will reconstitute an active GAI*4 protein and thereby drive 
expression of the HIS3 gene. Colonies that express HIS3 can 
be detected by their growth on petri dishes containing semi- 
solid agar based media lacking histidine. The cDNA can then 
be purified from these strains, and used to produce and 
5 isolate the bait HKNG1 gene product- interacting protein using 
techniques routinely practiced in the art. 

5.8.3. ASSAYS POR COMPOUNDS THAT INTERFERE WITH OR 
POTENTIATE HKNG1 GENE PRODUCT HACROMOLECULE 
INTERACTION 

10 The HKNG1 gene products may, in vivo, interact with one 

or more macromolecules, including intracellular 
macromolecules , such as proteins. Such macromolecules may 
include, but are not limited to, nucleic acid molecules and 
those proteins identified via methods such as those 
described, above, in Sections 5.8.1 - 5.8.2. For purposes of 
15 this discussion, the macromolecules are referred to herein as 
"binding partners". Compounds that disrupt HKNG1 gene 

product binding to a binding partner may be useful in 
regulating the activity of the HKNG1 gene product, especially 

mutant HKNG2 gene products. Such compounds may include, but 
20 are not limited to molecules such as peptides, and the like, 
as described, for example, in Section 5.8.2 above. 

The basic principle of an assay system used to identify 
compounds that interfere with or potentiate the interaction 
between the HKNG1 gene product and a binding partner or 
partners involves preparing a reaction mixture containing the 
25 HKNG1 gene product and the binding partner under conditions 
and for a time sufficient to allow the two to interact and 
bind, thus forming a complex. In order to test a compound 
for inhibitory activity, the reaction mixture is prepared in 
the presence and absence of the test compound. The test 
- compound may be initially included in the reaction mixture, 
30 or may be added at a time subsequent to the addition of HKNG1 
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gene product and its binding partner. Control reaction 
mixtures are incubated without the test compound or with a 
compound which is known not to block complex formation. The 
formation of any complexes between the JSKNG1 gene product and 
the binding partner is then detected. The formation of a 

5 complex in the control reaction , but not in the reaction 
mixture containing the test compound, indicates that the 
compound interferes with the interaction of the HKNG1 gene 
product and the binding partner. Additionally, complex 
formation within reaction mixtures containing the test 

0 compound and normal HKNG1 gene product may also be compared 
to complex formation within reaction mixtures containing the 
test compound and a mutant HKNG1 gene product. This 
comparison may be important in those cases wherein it is 
desirable to identify compounds that disrupt interactions of 
mutant but not normal HKNG1 gene product. 

15 In order to test a compound for potentiating activity, 

the reaction mixture is prepared in the presence and absence 
of the test compound. The test compound may be initially 
included in the reaction mixture, or may be added at a time 
subsequent to the addition of HKNG1 gene product and its 
binding partner* Control reaction mixtures are incubated 

20 

without the test compound or with a compound which is known 
not to block complex formation. The formation of any 
complexes between the BKWG1 gene product and the binding 
partner is then detected. Increased formation of a complex 
in the reaction mixture containing the test compound, but not 
25 in the control reaction, indicates that the compound enhances 
and therefore potentiates the interaction of the HKNG1 gene 
product and the binding partner. Additionally, complex 
formation within reaction mixtures containing the test 
compound and normal HKNG1 gene product may also be compared 

to complex formation within reaction mixtures containing the 
30 test compound and a mutant HKNG1 gene product. This 

comparison may be important in those cases wherein it is 
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desirable to identify compounds that enhance interactions of 
mutant but not normal HKNG1 gene product. 

In alternative embodiments, the above assays may be 
performed using a reaction mixture containing the HKNG1 gene 

product, a binding partner, and a third which disrupts or 

D 

enhances HKNG1 gene product binding to the binding partner* 

The reaction mixture is prepared and incubated in the 
presence and absence of the test compound, as described 
above, and the formation of any complexes between the HKNG1 

gene product and the binding partner is detected. In this 
iO embodiment , the formation of a complex in the reaction 

mixture containing the test compound, but not in the control 
reaction, indicates that the test compound interferes with 
the ability of the second compound to disrupt HKNG1 gene 

product binding to its binding partner. 

The assays for compounds that interfere with or 
X5 potentiate the interaction of the HKNG2 gene products and 

binding partners can be conducted in a heterogeneous or 
homogeneous format. Heterogeneous assays involve anchoring 
either the MKNG1 gene product or the binding partner onto a 

solid support and detecting complexes formed on the solid 
20 support at the end of the reaction. In homogeneous assays, 
the entire reaction is carried out in a liquid phase. In 
either approach, the order of addition of reactants can be 
varied to obtain different information about the compounds 
being tested. For example, test compounds that interfere 
with or potentiate the interaction between the HKNG1 gene 
25 Products and the binding partners, e.g., by competition , can 
be identified by conducting the reaction in the presence of 
the test substance; i.e., by adding the test substance to the 

reaction mixture prior to or simultaneously with the HKNG1 
gene product and interactive intracellular binding partner. 
Alternatively, test compounds that disrupt preformed 

30 

complexes, e.g., compounds with higher binding constants that 
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displace one of the components from the complex, can be 
tested by adding the test compound to the reaction mixture 
after complexes have been formed. The various formats are 
described briefly below. 

In a heterogeneous assay system, either the HKNG1 gene 

5 

product or the interactive binding partner, is anchored onto 
a solid surf ace, while the non-anchored species is labeled, 
either directly or indirectly, in practice, microtiter 
plates are conveniently utilized. The anchored species may 
be immobilized by non-covalent or covalent attachments. Non- 
covalent attachment may be accomplished simply by coating the 
solid surface with a solution of the HKNG1 gene product or 
binding partner and drying. Alternatively, an immobilized 
antibody specific for the species to be anchored may be used 
to anchor the species to the solid surface. The surf aces may 
be prepared in advance and stored. 
X5 In order to conduct the assay, the partner of the 

immobilized species is exposed to the coated surface with or 
without the test compound. After the reaction is complete, 
unreacted components are removed (e.g., by washing) and any 
complexes formed will remain immobilized on the solid 
surface. The detection of complexes anchored on the solid 
20 surface can be accomplished in a number of ways. Where the 
non-immobilized species is pre-labeled, the detection of 
label immobilized on the surface indicates that complexes 
were formed. Where the non-immobilized species is not pre- 
labeled, an indirect label can be used to detect complexes 
anchored on the surface; e.g., using a labeled antibody 
25 specific for the initially non- immobilized species (the 
antibody, in turn, may be directly labeled or indirectly 
labeled with a labeled anti-Ig antibody) . Depending upon the 
order of addition of reaction components, test compounds that 
inhibit complex formation or that disrupt preformed complexes 
can be detected. 

30 Alternatively, the reaction can be conducted in a liquid 

phase in the presence or absence of the test compound, the 
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reaction products separated from unreacted components, and 
complexes detected; e.gr., using an immobilized antibody 
specific for one of the binding components to anchor any 
complexes formed in solution, and a labeled antibody specific 
for the other partner to detect anchored complexes. Again, 

5 

depending upon the order of addition of reactants to the 
liquid phase, test compounds that inhibit complex formation 
or that disrupt preformed complexes can be identified. 

In an alternate embodiment of the invention, a 
homogeneous assay can be used* In this approach, a preformed 
complex of the HKNG1 gene product and the interactive binding 

10 

partner is prepared in which either the HKNG1 gene product or 
its binding partners is labeled, but the signal generated by 
the label is quenched due to complex formation (see, e.g., 
U.S. Patent No. 4,109,496 by Rubenstein which utilizes this 
approach for immunoassays) . The addition of a test substance 
15 that competes with and displaces one of the species from the 
preformed complex will result in the generation of a signal 
above background. In this way, test substances that disrupt 
HKNG1 gene product /binding partner interaction can be 

identified. 

In another embodiment of the invention, these same 
20 techniques can be employed using peptide fragments that 

correspond to the binding domains of the HKNG1 product and/or 

the binding partner (in cases where the binding partner is a 
protein) , in place of one or both of the full length 
proteins. Any number of methods routinely practiced in the 
art can be used to identify and isolate the binding sites. 

25 

These methods include, but are not limited to, mutagenesis of 

the gene encoding one of the proteins and screening for 

disruption of binding in a co~immunoprecipitation assay. 

Compensating mutations in the gene encoding the second 

species in the complex can then be selected. Sequence 

" analysis of the genes encoding the respective proteins will 
30 

reveal the mutations that correspond to the region of the 
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protein involved in interactive binding. Alternatively, one 
protein can be anchored to a solid surface using methods 
described in this Section above, and allowed to interact with 
and bind to its labeled binding partner, which has been 
treated with a proteolytic enzyme, such as trypsin. After 
5 washing, a short, labeled peptide comprising the binding 

. .. domain may remain associated with the solid material, which 
can be isolated and identified by amino acid sequencing. 
Also, once the gene coding for the segments is engineered to 
express peptide fragments of the protein, it can then be 
tested for binding activity and purified or synthesized* 

10 For example, and not by way of limitation, a HKNG1 gene 

product can be anchored to a solid material as described, 
above, in this Section by making a GST-HKNG1 fusion protein 

and allowing it to bind to glutathione agarose beads. The 
binding partner can be labeled with a radioactive isotope, 
such as 35 S, and cleaved with a proteolytic enzyme such as 
trypsin. Cleavage products can then be added to the anchored 
GST-HKNGl fusion protein and allowed to bind. After washing 

away unbound peptides, labeled bound material, representing 
the binding partner binding domain, can be eluted, purified, 
and analyzed for amino acid sequence by well-known methods. 
20 Peptides so identified can be produced synthetically or 
produced using recombinant DKA technology. 

5.8.4. ASSAYS FOR IDENTIFICATION OF COMPOUNDS 

THAT AMELIORATE A HKNG1 -MEDIATED DISORDER 

compounds, including but not limited to binding 
25 compounds identified via assay techniques such as those 
described, above, in Sections 5.8.1 - 5.8.4, can be tested 
for the ability to ameliorate symptoms of a HKNGl-mediated 

disorder, e.g. , a CNS-related disorder, such as a 
neuropsychiatric disorder, including schizophrenia and 
_ bipolar affective (mood) disorders, including severe bipolar 
30 affective (mood) disorder (BP-I) , bipolar affective (mood) 
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disorder with hypomania and major depression (BP-II) , and 
myopia disorders 

It should be noted that the assays described herein can 
identify compounds that affect XKNG1 activity by either 
affecting HKNG1 gene expression or by affecting the level of 
HKNG1 gene product activity. For example, compounds may be 
identified that are involved in another step in the pathway 
in which the HKNG1 gene and/or EKNG1 gene product is involved 
and, by affecting this same pathway may modulate the effect 
of HKNG1 on the development of a JSKNG1 -mediated disorder. 
10 Such compounds can be used, e.g. , as part of a therapeutic 
method for the treatment of the disorder. 

Described below are cell-based and animal model-based 
assays for the identification of compounds exhibiting such an 
ability to ameliorate symptoms of a HKNG1 -mediated disorder, 
e.g. . neuropsychiatric disorder, such as BAD or 
15 schizophrenia. 

First, cell-based systems can be used to identify 
compounds that may act to ameliorate symptoms of a HKNG1- 
mediated disorder. Such cell systems can include, for 
example, recombinant or non-recombinant cell, such as cell 
lines, that express the HKNG1 gene. 

20 

In utilizing such cell systems, cells that express HKNG1 
may be exposed to a compound suspected of eachibiting an 
ability to ameliorate symptoms of a HKNG1 -mediated disorder, 
e.g. . a neuropsychiatric disorder, such as BAD or 
schizophrenia, at a sufficient concentration and for a 
* 5 sufficient time to elicit such an amelioration of such 

symptoms in the exposed cells. After exposure, the ceils can 
be assayed to measure alterations in the expression of the 
HKNG1 gene, e.g*, by assaying cell lysates for BKNG1 mENA 
transcripts (e.gr. r by Northern analysis) or for HKNG1 gene 
products expressed by the cell; compounds that modulate 
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expression of the HKNG1 gene are good candidates as 
therapeutics. 

In addition, animal-based systems or models for a HKNGl- 
mediated disorder, e»q T , neuropsychiatry disorder, for 
5 example, transgenic mice containing a human or altered form 
of HKNG1 gene, may be used to identify compounds capable of 
ameliorating symptoms of the disorder. Such animal models 
may be used as test substrates for the identification of 
drugs, pharmaceuticals, therapies and interventions. For 
example, animal models may be exposed to. a compound suspected 
10 of exhibiting an ability to ameliorate symptoms, at a 

sufficient concentration and for a sufficient time to elicit 
such an amelioration of symptoms of a ffiWGI-raediated 
disorder. The response of the animals to the exposure may be 
monitored by assessing the reversal of the symptoms of the 
disorder. 

15 With regard to intervention, any treatments that reverse 

any aspect of symptoms of a HKNGl -mediated disorder, should 
be considered as candidates for human therapeutic 
intervention in such disorders. Dosages of test agents may 
be determined by deriving dose-response curves, as discussed 
in Section 5*10.1, below. 

20 

5.9. COMPOUNDS AKD METHODS FOR THE TREATMENT OP 
HKNGl —ME DIATED DISORDERS 

Described below are methods and compositions whereby a 
HKNGl —mediated disorder described herein, e.g. r a HKNGl- 
mediated neuropsychiatry disorder , such as BAD or 
schizophrenia,, may be treated ♦ For example, such methods can 
comprise administering compounds which modulate the 
expression of a mammalian HKNGl gene and/or the synthesis or 
activity of a mammalian HKNGl gene product (e.g., a 
_ recombinant HKNGl gene product) so symptoms of the disorder 
30 are ameliorated. 
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Alternatively, in those instances whereby the HKNGl- 
mediated disorders result from HKNG1 gene mutations, such 
methods can comprise supplying the subject with a nucleic 
acid molecule encoding an unimpaired HKNG1 gene product such 

5 that an unimpaired HKNG1 gene product is expressed and 

symptoms of the disorder are ameliorated. 

In another embodiment of methods for the treatment of 
HKNGl -mediated disorders resulting from HKNG1 gene mutations, 

such methods can comprise supplying the subject with a cell 
comprising a nucleic acid molecule that encodes an unimpaired 
10 HKNGl gene product such that the cell expresses the 

unimpaired HKNGl gene product and symptoms of the disorder 

are ameliorated. 

In cases in which a loss of normal HKNGl gene product 

function results in the development of a HKNGl —mediated 
15 disorder an increase in HKNGl gene product activity would 
facilitate progress towards an asymptomatic state in 
individuals exhibiting a deficient level of HKNGl gene 
expression and/or HKNGl gene product activity. Methods for 
enhancing the expression or synthesis of HKNGl can include, 
20 for example , methods such as those described below, in 
Section 5.9.2. 

Alternatively, symptoms of HKNGl -mediated 

neuropsychiatry disorders, may be ameliorated by 
administering a compound that decreases the level of HKNGl 
gene expression and/or HKNGl gene product activity. Methods 

25 

for inhibiting or reducing the level of HKNGl gene product 
synthesis or expression can include, for example, methods 
such as those described in Section 5.9.1* 

In one embodiment of treatment methods, the compounds 
administered comprise compounds, in particular drugs, which 
ameliorate the symptoms of a disorder described herein as a 

30 

neuropsychiatric disorder, such as BAD or schizophrenia. 
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Such compounds include drugs within the families of 
antidepressants such as lithium salts, carbamazepine, 
valproic acid, lysergic acid diethylamide (LSD) , p- 

chloropheny lalanine , p-propy Idopacetamide dithiocarbamate 
_ derivatives e.g., FLA 63; anti-anxiety drugs, e.g., diazepam; 
monoamine oxidase (MAO) inhibitors, e.g., iproniazid, 
clorgyline, phenelzine and isocarboxazid; biogenic amine 
uptake blockers, e.g., tricyclic antidepressants such as 

desipramine, imipramine and amitriptyline; serotonin reuptake 
inhibitors e.g., fluoxetine f antipsychotic drugs such as 

10 phenothiazine derivatives (e.g., chlorpromazine (thorazine) 

and trif luopromazine) ) , butyrophenones (e.g.- haloperidol 

(Haldol) ) , thioxanthene derivatives (e.g. , chlorprothixene) , 

and dibenzodiazepines (e.g., clozapine); benzodiazepines; 

dopaminergic agonists and antagonists e.g., LHDOPA, cocaine, 

15 

amphetamine , a-methyl~tyrosine , reserpine , tetrabenaz ine, 
benzotropine, pargyline; noradrenergic agonists and 
antagonists e.g., clonidine, phenoxybenz amine, phentolamine, 

tropolone. 

In another embodiment, symptoms of a disorder described 
herein, e.g., a HKNG1 -mediated neuropsychiatry disorder such 

as BAD or schizophrenia, may be ameliorated by EKNG1 protein 

therapy methods, e.g. . decreasing or increasing the level 
and/ or of HKNGI-activity using the HKNG1 protein, fusion 
protein, and peptide sequences described in Section 5.2, 
above, or by the administration of proteins or protein 
25 fragments (e.g., peptides) which interact with an HKNG1 gene 
or gene product and thereby inhibit or potentiate its 
activity. 

Such protein therapy may include, for example, the 

administration of a functional HKNG1 protein or fragments of 

- an HKNGl protein (e.g., peptides) which represent functional 
30 

HKNG1 domains* 
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In one embodiment, HKNG1 fragments or peptides 
representing a functional HKNGl binding domain are 
administered to an individual such that the peptides bind to 
an HKNGl binding protein, e.g., an HKNG1 receptor. Such 
fragments or peptides may serve to inhibit KKNG1 activity in 
an individual by competing with, and thereby inhibiting, 
binding of HKNG1 to the binding protein, thereby ameliorating 
symptoms of a disorder described herein. Alternatively, such 
fragments or peptides may enhance HKNG1 activity in an 
individual by mimicking the function of HKNGl in vivo, 

xo thereby ameliorating the symptoms of a disorder described 
herein. 

The proteins and peptides which may be used in the 
methods of the invention include synthetic (e.g., recombinant 
or chemically synthesized) proteins and peptides, as well as 
naturally occurring proteins and peptides. The proteins and 
15 peptides may have both naturally occurring and non-naturally 
occur ing amino acid residues (e.g., D-amino acid residues) 
and/or one or more non-peptide bonds (e.g., imino , ester, 
hydrazide, semicarbazide- and as o bonds). The proteins or 
peptides may also contain additional chemical groups (i.e., 

functional groups) present at the amino and/ or car boxy 
termxnx, such that, for example, the stability, 
bioavailability, and/or inhibitory activity of the peptide is 
enhanced. Exemplary functional groups include hydrophobic 
groups (e.g. carbobenzoxyl, dansyl, and t-butyl oxy car bony 1, 

groups), an acetyl group, a 9-f luorenylmethoxy-carbonyl 
25 9 rou P' an< * laacromolecular carrier groups (e.g., lip id-fatty 
acid conjugates, polyethylene glycol, or carbohydrates) 
including peptide groups. 

5.9.1* INHIBITORY ANTISENSE, RIB0ZYHE 
AND TRIPLE HELIX APPROACHES 

3 0 In another embodiment, symptoms of HKNGl -mediated 

neuropsychiatric disorders may be ameliorated by decreasing 
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the level of HKNG1 gene expression and /or HKNG1 gene product 
activity by using HKNG1 gene sequences in conjunction with 
well-known antisense, gene "knock-out ribozyme and/or 
triple helix methods to decrease the level of HKNTG1 gene 

5 expression* Among the compounds that may exhibit the ability 
to modulate the activity, expression or synthesis of the 
HKNG1 gene, including the ability to ameliorate the symptoms 

of a HKNG1 -mediated neuropsychiatric disorder, such as BAD or 

schizophrenia, are antisense, ribozyme, and triple helix 
molecules* Such molecules may be designed to reduce or 

*° inhibit either unimpaired, or if appropriate, mutant target 
gene activity* Techniques for the production and use of such 
molecules are well known to those of skill in the art, 

Antisense RNA and DNA molecules act to directly block 
the translation of mRNA by hybridizing to targeted mRNA and 
preventing protein translation* Antisense approaches involve 

« the design of oligonucleotides that are complementary to a 
target gene mRNA. The antisense oligonucleotides will bind 
to the complementary target gene mRNA transcripts and prevent 
translation* Absolute complement arily, although preferred, 
is not required* 

A sequence "complementary" to a portion of an RNA, as 

20 referred to herein, means a sequence having sufficient 

complementarily to be able to hybridize with the RNA, forming 
a stable duplex; in the case of double-stranded antisense 
nucleic acids, a single strand of the duplex DNA may thus be 
tested, or triplex formation may be assayed* The ability to 
hybridize will depend on both the degree of complementarily 

25 and the length of the antisense nucleic acid- Generally, the 
longer the hybridizing nucleic acid, the more base mismatches 
with an RNA it may contain and still form a stable duplex (or 
triplex, as the case may be) * One skilled in the art can 
ascertain a tolerable degree of mismatch by use of standard 
- procedures to determine the melting point of the hybridized 

30 complex* 
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In one embodiment, oligonucleotides complementary to 
non-coding regions of the HKNGl gene could be used in an 
antisense approach to inhibit translation of endogenous HKNGl 
mRNA. Antisense nucleic acids should be at least six 
5 nucleotides in length, and are preferably oligonucleotides 
ranging from 6 to about 50 nucleotides in length. In 
specific aspects the oligonucleotide is at least 10 
nucleotides, at least 17 nucleotides, at least 25 nucleotides 
or at least 50 nucleotides. 

Regardless of the choice of target sequence, it is 
10 P refer2red that ijl vitro studies are first performed to 

quantitate the ability of the antisense oligonucleotide to 
inhibit gene expression . It is preferred that these studies 
utilize controls that distinguish between antisense gene 
inhibition and nonspecific biological effects of 
oligonucleotides. It is also preferred that these studies 
x5 compare levels of the target RNA or protein with that of an 
internal control RNA or protein. Additionally, it is 
envisioned that results obtained using the antisense 
oligonucleotide are compared with those obtained using a 
control oligonucleotide. It is preferred that the control 
oligonucleotide is of approximately the same length as the 
20 test oligonucleotide and that the nucleotide sequence of the 
oligonucleotide differs from the antisense sequence no more 
than is necessary to prevent specific hybridization to the 
target sequence. 

The oligonucleotides can be DNA or RNA or chimeric 
mixtures or derivatives or modified versions thereof, single- 
25 stranded or double-stranded. The oligonucleotide can be 
modified at the base moiety, sugar moiety, or phosphate 
backbone, for example, to improve stability of the molecule, 
hybridization, etc. The oligonucleotide may include other 
appended groups such as peptides (e.g., for targeting host 
_ cell receptors in vivo) , or agents facilitating transport 
30 across the cell membrane (see, e.g., Letsinger, et al., 1989, 
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Proc. Nat!. Acad. Sci. U.S.A. 86:6553-6556; Lemaitre, at al. 
1987, Proc. Natl. Acad. Sci. U.S.A. 84:648-652; PCT 
Publication No. WO88/09810, published December 15, 1988) or 
the blood-brain barrier (see, e.g., PCT Publication No. 
5 WO89/10134, published April 25, 1988), hybridization- 
trxggered cleavage agents (see, e.g., K rol et al., i 988 
BioTechniques 6:958-976) 'or intercalating agents (see, e.g., 
Zon, 1988, Pharm. Res. 5:539-549). To this end, the 
olxgonucleotide may be conjugated to another molecule, e.g. 
a peptide, hybridization triggered cross-linking agent, 
transport agent, hybridization-triggered cleavage agent, etc. 

The antisense oligonucleotide may comprise at least one 
modxfxed base moiety which is selected from the group 
including but not limited to 5-f luorouracil, 5-bromouracii, 
5-chlorouracil, 5~iodouracil , hypoxanthine, xanthine, 

4- acetylcytosine, 5- (carboxyhydroxylmethyl) uracil. 

5- carboxymethylamindmethyl-2-thiouridine, 
5-carboxymethylaminomethyluracil, dihydrouracil, beta-D- 
galactosylgueosine, inosine, N6-isopentenyladenine 

1- methylguanine, 1-methyl inosine, 2 , 2-dimethylguanine, 

2- methyladenine, 2-methylguanine, 3-methylcytosine, 
5-methylcytosine, N6-adenine, 7-methylguanine, 
5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta~D-mannosylqueosine, 5'-methoxycarboxymethyluracil 
5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, 
uracxl-5-oxyacetic acid (v) , vybutoxosine, pseudouracil 
queosine, 2-thiocytosine, 5-m3thyl- 2 -thiouracil, 
2-thicuracil, 4-thiouracil, 5-methyiuracil, uracil- 
5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v) 
5-methyl-2-thiouracil, 3- (3-amiho-3-N-2-carboxypropyl) 
uracil, (ac P 3)w, and 2,6-diaminopurine. 

The antisense oligonucleotide may also comprise at least 
one modified sugar moiety selected from the group including 
- but not limited to arabinose, 2-f luoroarabinose, xylulose, 
•*° and hexose. 



- 86 - 



WO 99/47535 



PCT/US99/05606 



10 



in yet another embodiment, the antisense oligonucleotide 
comprises at least one modified phosphate backbone selected 
from the group consisting of a phosphor othi oat e, a 
phosphorodithioate, a phosphoramidothioate , a 
phosphoramidate, a phosphordiamidate, a methylphosphonate, an 
alkyl phosphotriester, and a formacetal or analog thereof. 

in yet another embodiment, the antisense oligonucleotide 
is an a-anomeric oligonucleotide. An a-anomeric 
oligonucleotide forms specific double- stranded hybrids with 
complementary rna in which, contrary to the usual p-units 
the strands run parallel to each other- (Gautier, et al., ' 
1987, Hucl. Acids Res. 15:6625-6641). The oligonucleotide is 
a 2'-o-methylribonucleotide (inoue, et al., 1987, Nucl. Acids 
Res. 15:6131-6148), or a chimeric RNA-DNA analogue (Inoue, efc 
al., 1987, FEBS Lett. 215:327-330). 

Oligonucleotides of the invention may be synthesized by 
IS standard methods known in the art, e.g., by use of an 

automated DNA synthesizer (such as are commercially available 
from Biosearch, Applied Biosystems, etc.). As examples, 
phosphorothioate oligonucleotides may be synthesized by the 
method of Stein, et al. (1988, Nucl. Acids Res. 16:3209), 
2o methylphosphonate oligonucleotides can be prepared by use of 
controlled pore glass polymer supports (Sarin, et al., 1988, 
Proc. Natl. Acad. Sci. U.S.A. 85:7448-7451), etc. 

While antisense nucleotides complementary to the target 
gene coding region sequence could be used, those 
complementary to the transcribed, untranslated region are 
25 most preferred. 

Antisense molecules should be delivered to cells that 
express the target gene In vivo, a number of methods have 
been developed for delivering antisense DNA or rna to cells; 
e.g., antisense molecules can be injected directly into the 
- tissue site, or modified antisense molecules, designed to 
30 target the desired cells (e.g., antisense linked to peptides 
or antibodies that specifically bind receptors or antigens 
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in its entirety). For example, a mutant, non-functional 
target gene (or a completely unrelated DNA sequence) flanked 
by DNA homologous to the endogenous target gene (either the 
coding regions or regulatory regions of the target gene) can 
be used, with or without a selectable marker and/ or a 
5 negative selectable marker, to transfect cells that express 
the target gene in vivo. Insertion of the DNA construct, via 
targeted homologous recombination, results in inactivation of 
the target gene. Such approaches are particularly suited in 
the agricultural field where modifications to ES (embryonic 
^ stem) cells can be used to generate animal offspring with an 
inactive target gene (e.g., see Thomas and Capecchi, 1987 and 
Thompson, 1989, supra). However this approach can be adapted 
for use in humans provided the recombinant DNA , constructs are 
directly administered or targeted to the required site In 
vivo using appropriate viral vectors. 
15 Alternatively, endogenous target gene expression can be 

reduced by targeting deoxyribonucleotide sequences 
complementary to the regulatory region of the target gene 
(i.e., the target gene promoter and/ or enhancers) to form 
triple helical structures that prevent transcription of the 
20 target 9 cne in target cells in the body, (See generally, 

Helene, 1991, Anticancer Drug Des., 6 (6) :569-584 ; Helene, et 
a!., 1992, Ann. N.Y, Acad. Sci*, 660:27-36; and Maher, 1992, 
Bioassays 14 (12) : 807-815) . 

Nucleic acid molecules to be used in triplex helix 
formation for the inhibition of transcription should be 
25 single stranded and composed of deoxynucleotides. The base 
composition of these oligonucleotides must be designed to 
promote triple helix formation via Hoogsteen base pairing 
rules, which generally require sizeable stretches of either 
purines or pyrimidines to be present on one strand of a 
duplex. Nucleotide sequences may be pyrimidine-based, which 
30 will result in TAT and CGC* triplets across the three 
associated strands of the resulting triple helix. The 
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pyrxmidine-rich molecules provide base complementary to a 
purxne-rich region of a single strand of the duplex in a 
parallel orientation to that strand, m addition, nucleic 
acid molecules may be chosen that are purine-rich, for 
example, contain a stretch of G residues. These molecules 
will form a triple helix with a DNA duplex that is rich in GC 
pairs, in which the majority of the purine residues are 
located on a single strand of the targeted duplex, resulting 
in 000 tri Plets across the three strands in the triplex. 

Alternatively, the potential sequences that can be 
targeted for triple helix formation may be increased by 
creating a so called "switchback" nucleic acid molecule. 
Switchback molecules are synthesized in an alternating 5'-3» 
3 '-5' manner, such that they base pair with first one strand' 
of a duplex and then the other, eliminating the necessity for 
a sizeable stretch of either purines or pyrimidines to be 
present on one strand of a duplex. 
15 in instances wherein the antisense, ribozyme, and/or 

triple helix molecules described herein are utilized to 
inhibit mutant gene expression, it is possible that the 
technique may so efficiently reduce or inhibit the 
transcription (triple helix) and/or translation (antisense, 
ribozyme) of mRNA produced by normal target gene alleles that 
the possibility may arise wherein the concentration of normal 
target gene product present may be lower than is necessary 
for a normal phenotype. m such cases, to ensure that 
substantially normal levels of target gene activity are 
maintained, therefore, nucleic acid molecules that encode and 
express target gene polypeptides exhibiting normal target 
gene activity may, be introduced into cells via gene therapy 
methods such as those described, below, in Section 5.9.2 that 
do not contain sequences susceptible to whatever antisense, 
ribozyme, or triple helix treatments are being utilized. 
Alternatively, in instances whereby the target gene encodes 
an extracellular protein, it may be preferable to co- 
administer normal target gene protein in order to maintain 
the requisite level of target gene activity. 
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Anti-sense RNA and DNA, ribozyme, and triple helix 
molecules of the invention may be prepared by any method 
known in the art for the synthesis of DNA and RNA molecules , 
as discussed above. These include techniques for chemically 
synthesizing oligodeoxyribonucleo tides and 
5 oligoribonucleotides well known in the art such as for 
example solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and 

in vivo transcription of DNA sequences encoding the antisense 

RNA molecule - Such DNA sequences may be incorporated into a 
wide variety of vectors that incorporate suitable RNA 
polymerase promoters such as the T7 or SP6 polymerase 
promoters. Alternatively, antisense cDNA constructs that 
synthesize antisense RNA constitutively or inducibly, 
depending on the promoter used, can be introduced stably into 
cell lines* 

15 

5.9.2* GENE REPLACEMENT THERAPY 

BKJHG1 gene nucleic acid sequences, described above in 

Section 5.1, can be utilized for transferring recombinant 
SKNG1 nucleic acid sequences to cells and expressing said 

sequences in recipient cells. Such techniques can be used, 
20 for example, in marking cells or for the treatment of a 

HKNG1 -mediated neuropsychiatry disorder. Such treatment can 

be in the form of gene replacement therapy. Specifically, 

one or more copies of a normal HKNG1 gene or a portion of the 

HKNG1 gene that directs the production of a HKNG1 gene 
25 Product exhibiting normal HKNG1 gene function, may be 

inserted into the appropriate cells within a patient, using 
vectors that include, but are not limited to adenovirus, 
adeno-associated virus , and retrovirus vectors , in addition 
to other particles that introduce DNA into cells, such as 
liposomes. 

30 Because the HKNG1 gene is expressed in the brain, such 

gene replacement therapy techniques should be capable of 
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delivering HKNGl gene sequences to these cell types within 
patients. Thus, in one embodiment, techniques that are well 
known to those of skill in the art (see, e.g., per 
Publication No. WO89/10134, published April 25, 1988) can be 
s used to enable HKNGl gene sequences to cross the blood-brain 
barrier readily and to deliver the sequences to cells in the 
brain. With respect to" delivery that is capable of crossing 
the blood-brain barrier, viral vectors such as, for example, 
these described above, are preferable. 

In another embodiment, techniques for delivery involve 
10 direct administration, e.gr., by stereotactic delivery of such 
HKNGl gene sequences to the site of the cells in which the 
HKNGl gene sequences are to be expressed. 

Additional methods that may be utilized to increase the 
overall level of HKNGl gene expression and/ or HKNGl gene 
15 product activity include using targeted homologous 

recombination methods, discussed in Section 5.2, above, to 
modify the expression characteristics of an endogenous HKNGl 
gene in a cell or microorganism by inserting a heterologous 
DNA regulatory element such that the inserted regulatory 
element is operatively linked with the endogenous HKNGl gene 
20 in question. Targeted homologous recombination can thus be 
used to activate transcription of an endogenous HKNGl gene 
that is "transcriptionally silent", i.e., is not normally 
expressed or is normally expressed at very low levels, or to 
enhance the expression of an endogenous HKNGl gene that is 
■ normally expressed. 

Further, the overall level of HKNGl gene expression 
and/or HKNGl gene product activity may be increased by the 
introduction of appropriate HKNGl -expressing cells, 
preferably autologous cells, into a patient at positions and 
_ in numbers that are sufficient to ameliorate the symptoms of 



- 94 - 



WO 99/47535 



PCT/US99/05606 



a JTJWGl-mediated neuropsychiatric disorder. Such cells may 
be either recombinant or non-recombinant. 

Among the cells that can be administered to increase the 
overall level of HKNGl gene expression in a patient are 
s normal cells, preferably brain cells, that express the HKNGl 
gene. Alternatively, cells, preferably autologous cells, can 
be engineered to express HKNGl gene sequences, and may then 
be introduced into a patient in positions appropriate for the 
amelioration of the symptoms of a HKNGl -mediated 
neuropsychiatric disorder. Alternately, cells that express 
an unimpaired HKNGl gene and that are from a MHC matched 
individual can be utilized, and may include, for example 
brain cells. The expression of the HKNGl gene sequences 'is 
controlled by the appropriate gene regulatory sequences to 
allow such expression in the necessary cell types. Such gene 
1S regulatory sequences are well known to the skilled artisan 
Such cell-based gene therapy techniques are well known to 
those skilled in the art, see, e.g., Anderson, U.S. Patent 
No. 5,399,349. 

When the cells to be administered are non-autologous 
cells, they can be administered using well known techniques 
20 that prevent a host immune response against the introduced 
cells from developing. For example, the cells may be 
introduced in an encapsulated form which, while allowing for 
an exchange of components with the immediate extracellular 
environment, does not allow the introduced cells to be 
recognized by the host immune system. 
25 Additionally, compounds, such as those identified via 

techniques such as those described, above, in Section 5.8, 
that are capable of modulating HKNGl gene product activity 
can be administered using standard techniques that are well 
known to those of skill in the art. in instances in which 
_ the compounds to be administered are to involve an 
30 interaction with brain cells, the administration techniques 
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should include well known ones that allow for a crossing of 
the blood-brain barrier. 

5.10. PHARMACEUTICAL PREPARATIONS 

AND METHODS OF ADMINISTRATXON 

5 The compounds that are determined to affect HKNG1 gene 

expression or gene product activity can be administered to a 
patient at therapeutically effective doses to treat or 
ameliorate a HKNG1 -mediated disorder or modulate a HKNG1- 

related process described herein. A therapeutically 
effective dose refers to that amount of the compound 
sufficient to result in amelioration of symptoms of such a 
disorder* 

5.10.1. EFFECTIVE DOSE 

Toxicity and therapeutic efficacy of such compounds can 
15 be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, e.g., for determining the 

LD 50 (the dose lethal to 50% of the population) and the ED 50 
(the dose therapeutically effective in 50% of the 
population) . The dose ratio between toxic and therapeutic 
effects is the therapeutic index and it can be expressed as 

20 the ratio IiDsq/EDso. Compounds that exhibit large therapeutic 
indices are preferred. While compounds that exhibit toxic 
side effects may be used, care should be taken to design a 
delivery system that targets such compounds to the site of 
affected tissue in order to minimize potential damage to 
uninfected cells and, thereby, reduce side effects. 

25 The data obtained from the cell culture assays and 

animal studies can be used in formulating a range of dosage 
for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that 
include the ED 50 with little or no toxicity* The dosage may 
vary within this range depending upon the dosage form 

30 employed and the route of administration utilized. For any 
compound used in the method of the invention, the 
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therapeutically effective dose can be estimated initially 
from cell culture assays, A dose may be formulated in animal 
models to achieve a circulating plasma concentration range 
that includes the IC 50 (i.e., the concentration of the test 
compound that achieves a half-maximal inhibition of symptoms) 

5 

as determined in cell culture* Such information can be used 
to more accurately determine useful doses in humans. Levels 
in plasma may be measured, for example, by high performance 
liquid chromatography* 

As defined herein, a therapeutically effective amount 
of antibody, protein, or polypeptide (i.e., an effective 

xo 

dosage) ranges from about 0*001 to 30 mg/kg body weight, 
preferably about 0.01 to 25 lag/kg body weight, more 
preferably about 0.1 to 20 mg/kg body weight, and even more 
preferably about 1 to 10 mg/kg, 2 to 9 mg/kg, 3 to 8 mg/kg, 4 
to 7 mg/kg, or 5 to 6 mg/kg body weight. 

The skilled artisan will appreciate that certain factors 
* 5 may influence the dosage required to effectively treat a 
subject, including but not limited to the severity of the 
disease or disorder, previous treatments, the general health 
and/or age of the subject, and other diseases present. 
Moreover, treatment of a subject with a therapeutically 
effective amount of a protein, polypeptide, or antibody can 

20 

include a single treatment or, preferably, can include a 
series of treatments. In a preferred example, a subject is 
treated with antibody, protein, or polypeptide in the range 
of between about 0.1 to 20 mg/kg body weight, one time per 
week for between about 1 to 10 weeks, preferably between 2 to 
8 weeks, more preferably between about 3 to 7 weeks, and even 

25 

more preferably for about 4, 5, or 6 weeks. It will also be 

appreciated that the effective dosage of antibody, protein, 

or polypeptide used for treatment may increase or decrease 

over the course of a particular treatment. Changes in dosage 

may result and become apparent from the results of diagnostic 

- assays as described herein. 
30 
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5 ' 10 * 2 - FORMUliATTONS AND PSE 
Pharmaceutical compositions for use in accordance with 
the present invention may be formulated in conventional 
manner using one or more physiologically acceptable carriers 
or excipients. 

Thus, the compounds and their physiologically acceptable 
salts and solvates may be formulated for administration by 
inhalation or insufflation (either through the mouth or the 
nose) or oral, buccal, parenteral rectal or topical 
administration* 

10 F ° r ° ral administration * the pharmaceutical compositions 

may take the form of , for example, tablets or capsules 
prepared by conventional means with pharmaceutically 
acceptable excipients such as binding agents (e.g., 
pregelatinised maize starch, polyvinylpyrrolidone or 
hydroxypropyl methylcellulose) ; fillers (e.g., lactose, 
15 microcrystalline cellulose or calcium hydrogen phosphate) ; 
lubricants (e.g., magnesium stearate, talc or silica); 
disintegrants (e.g., potato starch or sodium starch 
glycolate); or wetting agents (e.g., sodium lauryl sulphate). 
The tablets may be coated by methods well known in the art* 
20 Liquid P re P arati *cms for oral administration may take the form 
of, for example, solutions, syrups or suspensions, or they 
may be presented as a dry product for constitution with water 
or other suitable vehicle before use. Such liquid 
preparations may be prepared by conventional means with 
pharmaceutically acceptable additives such as suspending 
agents (e»g., sorbitol syrup, cellulose derivatives or 

25 

hydrogenated edible fats); emulsifying agents (e.g., lecithin 
or acacia).; non-aqueous vehicles (e.g., almond oil, oily 
esters, ethyl alcohol or fractionated vegetable oils); and 
preservatives (e.g., methyl or propyl-p-hydroxybenzoates or 
„ sorbic acid) . The preparations may also contain buffer 
10 salts, flavoring, coloring and sweetening agents as 
appropriate. 
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Preparations for oral administration may be suitably 
formulated to give controlled release of the active compound. 

For buccal administration the compositions may take the 
form of tablets or lozenges formulated in conventional 
manner . 

5 For administration by inhalation, the compounds for use 

according to the present invention are conveniently delivered 
in the form of an aerosol spray presentation from pressurized 
packs or a nebuliser, with the use of a suitable propellant, 
e.g., dichlorodif luoromethane , trichlorof luoromethane, 

10 dichlorotetrafluoroethane ' carbon dioxide or other suitable 
gas. In the case of a pressurized aerosol the dosage unit 
may be determined by providing a valve to deliver a metered 
amount. Capsules and cartridges of e.g., gelatin for use in 
an inhaler or insufflator may be formulated containing a 
powder mix of the compound and a suitable powder base such as 
lactose or starch. 

The compounds may be formulated for parenteral 
administration by injection, e.g., by bolus injection or 
continuous infusion. Formulations for injection may be 
presented in unit dosage form, e.g., in ampoules or in multi- 
dose containers, with an added preservative. The 
20 compositions may take such forms as suspensions , solutions or 
emulsions in oily or aqueous vehicles, and may contain 
formulatory agents such as suspending, stabilizing and/or 
dispersing agents. Alternatively, the active ingredient may 
be in powder form for constitution with a suitable vehicle, 
e.g., sterile pyrogen-free water, before use. 
25 The compounds may also be formulated in rectal 

compositions such as suppositories or retention enemas, e.g., 

containing conventional suppository bases such as cocoa 

butter or other glycerides. 

In certain embodiments, it may be desirable to 

- administer the pharmaceutical compositions of the invention 
30 

locally to the area in need of treatment. This may be 
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achieved by, for example, and not by way of limitation, local 
infusion during surgery, topical application, e.g., in 
conjunction with a wound dressing after surgery, by 
injection, by means of a catheter, by means of a suppository 
s or by means of an implant, said implant being of a porous, 
non-porous, or gelatinous material, including membranes, such 
as sialastic membranes, or fibers, m one embodiment, 
administration can be by direct injection at the site' (or 
former site) of a malignant tumor or neoplastic or pre- 
neoplastic tissue. 

xo For topical application, the compounds may be combined 

with a carrier so that an effective dosage is delivered, 
based on the desired activity 

A topical formulation for treatment of spme of the eye 
disorders discussed infra (e.g., myopia) consists of an 
effective amount of the compounds in a ophthalmologically 
1J} acceptable excipient such as buf f ered saline, mineral oil 
vegetable oils such as corn or arachis oil, petroleum jelly 
Miglyol 182, alcohol solutions, or liposomes or liposome-like 
products. Any of these compositions may also include 
preservatives, antioxidants, antibiotics, immunosuppressants, 
and other biologically or pharmaceutically effective agents ' 
20 which do not exert a detrimental effect on the compound. 

in addition to the formulations described previously, 
the compounds may also be formulated as a depot preparation. 
Such long acting formulations may be administered by 
implantation (for example subcutaneous ly or intramuscularly) 
or by intramuscular injection. Thus, for example, the 
2J} compounds may be formulated with suitable polymeric or 
hydrophobic materials (for example as an emulsion in an 
acceptable oil) or ion exchange resins, or as sparingly 
soluble derivatives, for example, as a sparingly soluble 
salt. 

The compositions may, if desired, be presented in a pack 
30 or dispenser device that may contain one or more unit dosage 
forms containing the active ingredient. The pack may for 
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example comprise metal or plastic foil, such as a blister 
pack- The pack or dispenser device may be accompanied by 
instructions for administration. 

6. EXAMPLE: THE HKNG1 GENE OP CHROMOSOME 18 IS 
5 ASSOCIATED WITH THE NEUROPSYCHIATRY DISORDER 

BAD 

In the Example presented in this Section, studies are 
described that define a narrow interval of approximately 27 
kb on the short arm of human chromosome 18 which is 
associated with the neuropsychiatric disorder BAD* The 
10 interval is demonstrated to lie within the gene referred to 
herein as the HKNGl gene. 



6-1. MATERIALS AND METHODS 
6.1,1. LINKAGE DIB EODI LIBRIUM 
Linkage disequilibrium (LD) studies were performed using 
DNA from a population sample of neuropsychiatric disorder 
(BP-I) patients. The population sample and LD techniques 
were as described in Escamilla et al., 1996, Am jr. Med. 
Genet. 57:244-253. The present LD study took advantage of 
the additional population sample collection and the 
20 additional physical markers identified via the physical 
mapping techniques described below. 

6.1.2. YEAST ARTIFICIAL CHROMOSOME ( YAC1 MAPPING 

For physical mapping, yeast artificial chromosomes 
(YACs) containing human sequences were mapped to the region 
25 being analyzed based on publicly available maps (Cohen et 
al., 1993, C.R. Acad. Sci. 316;1484-1488) . The YACs were 

then ordered and contig reconstructed by performing standard 
short tag sequence (STS) -content mapping with microsatellite 
markers and non-polymorphic STSs available from databases 
that surround the genetically defined candidate region. 
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15 



6 * 1 ' 3, BACTERIAL ART1FICT AL CHROMOSOME tBAtt) MAPPTMO 

STSs from the short arm of human chromosome 18 were 
used to screen a human BAC library (Research Genetics, 
Huntsville, AL) . The ends of the BACs were cloned or 
directly sequenced. The end sequences were used to amplify 
the next overlapping BACs. From each BAC, additional 
microsatellites were identified. Specifically, random 
sheared libraries were prepared from overlapping BACs within 
the defined genetic interval. BAC DNA was sheared with a 
nebulizer (CIS-US Inc., Bedford, MA). Fragments in the size 
range of 600 to 1,000 bp were utilized for the sublibrary 
production. Microsatellite sequences from the sublibraries 
were identified by corresponding microsatellite probes. 
Sequences around such repeats were obtained to enable 
development of PCR primers for genomic DNA. 

«»1»4. RADIATION HYHR^n (RH) M2VPPTWR 

Standard RH mapping techniques were applied to a 
Stanford G3 RH mapping panel (Research Genetics, Huntsville,. 
AL) to order all microsatellite markers and non-polymorphic' 
STSs in the region being analyzed. 



20 6.1.5. SAMPLE SEOPENCING 

Random sheared libraries were made from all the BACs 
within the defined genetic region. Approximately 9,000 
subclones within the approximately 340 kb region containing 
the BAD interval were sequenced with vector primers in order 
to achieve an 8-fold sequence coverage of the region. All 

25 sequences were processed through an automated sequence 
analysis pipeline that assessed quality, removed vector 
sequences and masked repetitive sequences. The resulting 
sequences were then compared to public DNA and protein 
databases using BLAST algorithms (Altschul, et al., 1990, J. 

_ Molec. Biol., 215:403-410).' 
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All sequences were contiged using Sequencher 3.0 (Gene 
Code Corp.) and PHRED and PHRAP (Phill Green, Washington 
University) into a single DNA fragment of 340 kb. 



6.2. RESULTS 

5 ^ 

Genetic regxons involved in bipolar affective disorder 
(BAD) human genes had previously been reported to map to 
portions of the long (I8q) and short (I8p) arms of human 
chromosome 18 (Freimer et al., 1996, Neuropsychiat. Genet. 
67:254-263; Freimer et al., 1996, Nature Genetics 12:436-441; 
10 and Mclnnxs et al.. Proa. Natl. Acad. Scie. U.S.A. 93 i 13060- 
13065) . 

High resolution physical map ping ugincr YAC. bap and mr 
techniques, In order to provide the precise order of genetic 
markers necessary for linkage and LD mapping, and to guide 
new microsatellite marker development for finer mapping, a 
15 high resolution physical map of the I8p candidate region was 
developed using YAC, BAC and RH techniques. 

For such physical mapping, first, YACs were mapped to 
the chromosome 18 region being analyzed. Using the mapped 
YAC contig as a framework, the region from publicly available 
markers spanning the I8p region were also mapped and contiged 
20 with BACs. Sublibraries from the contiged BACs were 

constructed, from which microsatellite marker sequences were 
identified and sequenced. 

?° ensure development of an accurate physical map, the 
radiation hybrid (RH) mapping technique was independently 
applied to the region being analyzed, rh was used to order 
25 all microsatellite markers and non-polymorphic STSs in the 
region. Thus, the high resolution physical map ultimately 
constructed was obtained using data from RH mapping and STS- 
content mapping. 

Linkage pjseguilj r briinn. Prior to attempting to identify 
gene sequences, studies were performed to further narrow the 
30 neuropsychiatry disorder region. Specifically, a linkage 
disequilibrium (LD) analysis was performed using population 
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samples and techniques as described in Section 6.1, above, 
which took advantage of the additional physical markers 
identified via the physical mapping techniques described 
below. 

Initial LD analysis narrowed the interval which 
5 associates with BAD disorders to a 340 kb region of l8p. BAC 
clones within this newly identified neuropsychiatric disorder 
region were analyzed to" identify specific genes within the 
region. A combination of sample sequencing, cDNA selection 
and transcription mapping analyses were used to arrange 
sequences into tentative transcription units, that is, 
10 tentatively delineating the coding sequences of genes 'within 
this genomic region of interest. 

Subsequent LD analyses further narrowed the BAD region 
of lap to a narrow interval of approximately 27 kb. This was 
accomplished by identifying the maximum haplotype shared 
among affected individuals using additional markers. 
15 Statistical analysis of the entire 18p candidate region 

indicated that the 27 kb haplotype was significantly elevated 
in frequency among affected Costa Rican individuals 
(liOD = 2.2; p = 0.0005). 

This newly identified narrow interval was found to map 
completely within one of the transcription units identified 
20 as described above. The gene corresponding to this 

transcription unit is referred to herein as the HKNGl gene. 
Thus, the results of the mapping analyses presented in this 
Section demonstrate that the HKNGl gene of human chromosome 
18 is associated the neuropsychiatric disorder BAD. 
25 Analysis of the BAD interval indicated that the 27 kb 

BAD disease-associated chromosomal interval identified in the 
linkage disequilibrium studies is contained within an 
approximately 60 kb genomic region which contains a sequence 
referred to as GS4642 or rod photoreceptor protein (RPP) gene 
(Shimizu-Matsumoto, A. et al., 1997, Invest. Ophthalmol. Vis. 
3 ~ Q Sci. 38:2576-2585). 
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7* EXAMPLE: SEQTJKNCE AND CHARACTER! ZATION 
OP TOE HKNGl GENE 

As demonstrated in the Example presented in Section 6, 
above, the HKNGl gene is involved in the neuropsychiatry 

disorder BAD. The results presented in this Section further 

5 

characterize the HKNGl gene and gene product. In particular , 

isolation of additional* cDNA clones and analyses of genomic 
and cDNA sequences have revealed both the full length HKNGl 

amino acid sequence and the HKNGl genomic intron/exon 

structure. In particular, the nucleotide and predicted amino 
XO acid sequence of the HKNGl gene identified by these analyses 

disclose new HKNGl exon sequences, including new HKNGl 

protein coding sequence, discovered herein. Further, the 
expression of HKNGl in human tissue, especially neural 
tissue, is characterized by Northern and in situ 
XS hybridization analysis. The results presented herein are 
consistent with the HKNGl gene being a gene which mediates 
neuropsychiatric disorders such as BAD* 

7 # 1. MATERIALS AND METHODS 
HKNGl cDNA Clone Isolation t Hybridization of a human 
20 brain and Kidney cDNA library was performed according to 
standard techniques and identified a full-length HKNGl cDNA 
clone. In addition, a HKNGl cDNA derived from a splice 

variant was isolated, as described in Section 7.2, below. 

Northern Blot Analysis : Standard UNA isolation 

„ techniques and Northern blotting procedures were followed. 
25 

The HKNGl probe utilized corresponds to the complementary 
sequence of base pairs 1367 to 1578 of the full length HKNGl 
cDNA sequence (SEQ ID NO. l) . clontech multiple tissue 
northern blots were probed. In particular, Clontech human I, 
_ human II, human III, human fetal II, human brain II and human 
30 brain III blots were utilized for this study. 
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In Situ Hybridization Analysis : Standard in situ 
hybridization techniques were utilized. The HKNG1 probe 

utilised corresponds to the complementary sequence of base 
pairs 910 to 1422 of the full length HKNG1 cDNA sequence (SEQ 

ID NO • 1)- Brains for in situ hybridization analysis were 
obtained from McLean Hospital (The Harvard Brain Tissue 
Resource Center, Belmont, MA 02178) . 

Other techniques : The remaining techniques described in 
Section 7*2, below, were performed according to standard 
techniques or as discussed in Section 6,1, above. 

XO 

7»2. RESULTS 
7.2.1. HKNG2 Nucleotide and Amino Acid Secruence 

A human brain cDNA library was screened and a full- 
length clone of HKNG1 was isolated from this library, as 

described above. By comparing the isolated cDNA sequence to 
15 sequences in the public databases, a clone was identified 
which had been previously identified as GS4642, or rod 
photoreceptor protein (HPP) gene (GenBank Accession No. 
D63813; Shimizu-Matsumoto, A. et al., 1997, Invest- 
Ophthalmol. Vis. Sci. 38:2576-2585). Although Shimizu- 
Matsumoto et al. refer to GS4642 as a full-length cDNA 
20 sequence, the isolated EKNG1 cDNA extends approximately 200 

bp beyond the 5»end of the identified GS4642 clone. 

Importantly, the 8KNG2 clone isolated herein reveals 

that, contrary to the amino acid sequence described in 
Shimizu-Matsumoto et al., the full length HKNG1 amino acid 

25 sequence contains an additional 29 amino acid residues N- 
terminal to what had previously been identified as the full- 
length RPP (SEQ ID NO: 64). The full-length HKNG1 nucleotide 
sequence (SEQ ID NO: 1) and the derived amino acid sequence 
of the full-length HKNG1 polypeptide (SEQ ID NO: 2) encoded 

by this sequence are depicted in FIG. 1A-1B. 
30 The full-length HKNGl polypeptide was found to contain 

two clusterin similarity domains: clusterin similarity 
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domain 1 which corresponds to amino acid residues 134 to 
amino acid residue 160, and clusterin similarity domain 2 
which corresponds to amino acid residue 334 to amino acid 
residue 362 • Such cluterin domains are typically 
characterized by five shared cysteine residues. In clusterin 
5 domain 1, these shared cysteine residues correspond to Cys 
134, Cysl45, Cysl48, Cysl58, and Cys 160* The shared 
cysteine residues in clusterin domain 2 correspond to the 
residues Cys334, Cys344, Cys351, Cys354, and Cys362, 

Full-length KXNG1 cDNA sequence was compared with the 

genomic contig completed by random sheared library 
3,0 sequencing. Exon-intron boundaries were identified manually 
by aligning the two sequences in Sequencher 3.0 and by 
observing the conservative splicing sites where the 
alignments ended. This sequence comparison revealed that the 
additional cDNA sequence discovered through isolation of the 
full-length HKNG1 cDNA clone actually belongs within three 

15 

HKNG1 exons . 

Prior to the isolation and analysis of HKNG1 cDNA 

described herein, nine exons were predicted to be present 
within the corresponding genomic sequence. As discovered 
herein, however, the HKNG1 gene, in contrast, actually 
20 contains 13 exons, with the new cDNA containing sequence 

which corresponds to a new exon 1, exon 2 and a 5 1 extension 
of what had previously been designated exon 1. Splice 
variants, discussed in Section 9 below, also exist which 
comprise additional exons 2 f and 2 M * The genomic sequence 
and intron/exon structure of the HKNG1 gene is shown in FIG. 

25 

3A-3R. 

The breakdown of exons was confirmed by the perfect 
alignment of the cDNA sequence with the genomic sequence and 
by observation of expected splicing sites flanking each of 
the additional, newly discovered exons. 

HKUG1 nucleotide sequence was used to search databases 

30 

of partial sequences of cDNA clones. This search identified 
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a partial cDNA sequence derived from IMAGE clone R61493 
having similarity to the human HKNGl sequence. IMAGE clone 
R61493 was obtained and consists of a cDNA insert, the Lafmid 
BA vector backbone, and DNA originating from the oligo dT 
primer and Hind III adaptors used in cDNA library 
construction. The Lafmid BA vector nucleotide sequence is 
available at the URL http://lmagB.rzpd.de/lafmida_sBq.html 
and descriptions of the oligo dT primer and Hind III adaptors 
are available in the GENBANK record corresponding to 
accession number R61493. 

The sequence of the cDNA insert revealed that the insert 
was derived from an alternatively spliced HKNG1 mRNA variant, 
referred to herein as HKNG1-V1. In particular, this HKNG1 
variant is deleted for exon 3 of the full length 13 exon 
HKNG1 sequence. The nucleotide sequence of this HKNG1 
variant (SEQ ID NO: 3) is depicted in FIG- 2A-B. The amino 
15 acid sequence encoded by the HKNGl variant (SEQ ID NO: 3) is 
also shown in FIG, 2A-B. 

Preferably therefore, the nucleic acids of the invention 
include nucleic acid molecules comprising the nucleotide 
sequence of HKNGl-Vi or encoding the polypeptide encoded by 
^ 0 EKNG1-V1 in the absence of heterologous sequences (e.g., 

cloning vector sequences such as Lafmid BA; oligo dT primer, 
and Hind III adaptor) . 

7.2.2. HKNGl GENE EXPRESSION 
HKNGl gene expression was examined by Northern blot 
25 analysis in various human tissues. A transcript of 

approximately 2 kb was detected in fetal brain, lung and 
kidney, and in adult brain, kidney, pancreas, prostate, 
testis, ovary, stomach, thyroid, spinal cord, lymph node and 
trachea. An approximately 1.5 kb transcript was also seen in 
_ trachea. In addition, a larger transcript of approximately 5 
30 kb was detected in all adult neural regions tested (that is, 
cerebellum- cortex, medulla, spinal cord, occipital pole, 
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frontal lobe, temporal, putamen, amygdala, caudatte nucleus, 
corpus callosum, hippocampus, whole brain, substantia nigra, 
subthalamic nucleus and thalamus) . Once again, this is in 
direct contrast to previous Northern analysis of the RPP 
gene, which reported that expression was limited to the 

5 

retina (Shimizu-Matsumoto, A. et al., 1997, Invest . 
Ophthalmal. Vis* Sci. 38:2576-2585). 

Analysis of HKNGl the tissue distribution was extended 

through an in situ hybridization analysis. In particular, 
the HKNGl mRNA distribution in normal human brain tissue was 

10 analyzed. The results of this analysis are depicted in FIG. 
4. As summarized in FIG. 4, HKNG1 is expressed throughout 

the brain, with transcripts being localized to neuronal and 
grey matter cell types . 

Finally, expression of HKNGl in recombinant cells 

demonstrates that the HKNGl gene encodes a secreted 

15 polypeptide (s) - 

8. A MISSENSE MUTATION WITHIN HKNGl 
CORRELATES WITH BAD 

The Example presented in Section 6, above, shows that 
the BAD disorder maps to an interval completely contained 

20 

within the HKNGl gene of the short arm of human chromosome 
18. The Example presented in Section 7, above, characterizes 
the HKNGl gene and gene products. The results presented in 

this Example further these studies by identifying a mutation 
within the coding region of a HKNGl allele of an individual 

25 exhibiting a BAD disorder. 

Thus, the results described herein demonstrate a 
positive correlation between a mutation which encodes a non- 
wild-type HKNGl polypeptide and the appearance of the 
neuropsychiatric disorder BAD. The results presented herein, 
coupled with the results presented in Section 6, above, 
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identify HKUfGl as a gene which mediates neuropsychiatry 
disorders such as BAD. 

Bmlm MATERIALS AMD METHODS 

Pairs of PCR primers that flank each exon (see TABLE l, 
above) were made and used to PCR amplify genomic DNA isolated 
from BAD affected and normal individuals. The amplified 1PCR 
products were analyzed using SSCP gel electrophoresis or by 
DNA sequencing. The DNA sequences and SSCP patterns of the 
affected and controls were compared and variations were 
further analyzed. 



8.2. RESULTS 

In order to more definitively show that the HKNG2 gene 
mediates neuropsychiatry disorders, in particular BAD, a 
study was conducted to explore whether a HKNG1 mutation that 

X5 correlates with BAD could be identified. 

First, exon scanning was performed on all eleven exons 
of the HKNG1 gene using chromosomes isolated from three 
affected and one normal individual from the Costa Rican 
population utilized for the LD studies discussed in Section 

^6, above* No obvious mutations correlating with BAD were 
found through this analysis* 

Next, HKNG1 intron and 3 • -untranslated regions within 
the 27 kb BAD interval were scanned by sscp and/or sequencing 
for all variants among three affected and one normal 
individual from the same population. Approximately 60 

25 variants were identified after scanning approximately two- 
thirds of the 27 kb genomic interval, which can be genotyped 
and analyzed by haplotype sharing and LD analyses, as 
described above, in order to identify ones which correlate 
with bipolar affective disorder. Fig. 5 lists selected 
variants identified through this study. 

3 ^ Exon scanning using chromosomal DNA from the general 

population of Costa Rica, however, successfully identified a 
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HKNGl missense mutation in an individual affected with BAD 

who did not share the common diseased haplotype identified by 
the LD analysis provided above. In particular, exon scanning 
was done on exons 1-11 of HKNGl nucleic acid from 129 

individuals from the general population affected with BAD. 

This analysis identified a point mutation in the coding 
region of exon 7 not seen in non-bipolar affected disorder 
individuals. Specifically, the guanine corresponding to 
nucleotide residue 604 of SEQ ID NO:l (or nucleotide residue 
550 of SEQ ID NO: 3) had mutated to an adenine. HKNGl protein 
expressed from this mutated HKNGl allfele comprises the 

substitution of a lysine residue at amino acid residue 202 of 
SEQ ID NO: 2 (or amino acid residue 184 of SEQ ID NO: 4} in 
place of the wild-type glutamic acid residue. 

Additional HKNGl polymorphisms relative to the HKNGl 

wild-type sequence, and which, therefore, represent HKNGl 

alleles, were identified through sequence analysis of the 
HKNGl alleles within a collection of schizophrenic patients 

of mixed ethnicity from the United States and within a BAD 
collection from the San Francisco area. These variants are 
depicted in FIGS. 5A and 5B, respectively • Statistical 
analysis indicated that there were significantly more 
variants in the collection of schizophrenic patients of mixed 
ethnicity from the United States and the San Francisco BAD 
and Costa Rican BAD samples than in a collection of 242 
controls (p < 0.05). 

9. EXAMPLE: IDENTIFICATION OF ADDITIONAL 
HKNGl SPLICE VARIANTS 

This example describes the isolation and identification 
of three novel splice variants of the human gene HKNGl. 
First, a novel HKNGl clone was isolated from a human retinal 

cDNA library. This clone, which completely lacks exon 7 of 
the full length HKNGl cDHA sequence, is referred to herein as 

HKNGl A7 • Because the deletion of exon 7 from the full length 
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HKNG1 sequence leads to an immediate frameshift, the clone 
HKNG1A7 encodes a truncated form of the HKN61 protein. The 
HKNG1A7 cDNA sequence (SEQ ID NO: 65) is depicted in FIG. 18 
along with the predicted amino acid sequence (SEQ ID NO: 66) 
5 of the HKNG1A7 gene product it encodes. 

Two other novel splice variants, referred to herein as 
HKNG1-V2 and HKNG1-V3, were isolated and identified by using 
RT-PCR analysis to isolate additional HKNG1 sequences. The 
following primer sequences were used: 

5 • — AGTTGCGTCCCTCTCTGTTG-3 1 (SEQ ID NO: 67) 

5 1 — GCTTCATGTTCCCGCTGTTA— 3 1 (SEQ ID NO: 68) 

These splice variants included additional exons between exons 
2 and 3 of the full length HKNG1 sequence * tSEQ ID N0:1) . 

The RT-PCR product derived from HKNG1-V2 includes a 
novel exon referred to as "axon 2' whereas the RT-PCR 
product derived from HKNG1-V3 includes a novel exon referred 
to as "exon 2" The sequence of these novel exons are 
provided in Table 2 below. The nucleotide sequence of the 
HKNG1-V2 RT-PCR product containing novel exon 2' is depicted 
in FIG. 6A (SEQ ID NO: 36) , whereas the HKNG1-V3 RT-PCR 
20 Product containing novel exon 2" is depicted in FIG. 6B (SEQ 
ID NO: 37). Both exon 2' and 2" are part of the 5»- 
untranslated region of the HKNG1 cDNA. 



15 



TABLE 2 

Exon 2 1 5 • -TTCCCTCCCTTTGGAACGCAGCGTGGGCACC (SEQ ID NO: 34) 
TGCAACGCAGAGACCACTGTATCCCCGGTGCAGA 
ATGTAATGAGTGCCTGATACATTTGCCGAATAAA 
CTATTCCAAGGGTTGAACTTGCTGGAAGCAAGAG 
AAGCACTATTCTGG-3 9 
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Exon 2" 5 1 -ATOGAGTCTTGCTCTCGTTGCCCAGACTGGA (SEQ DI NO: 35) 

GTGCACTGCTGCGATCTCAGCTCACTGCAACCTC 

TACCTCCCAGGTTCAAGCGATTCTCCTGCCTCAG 
CCTCTCGAGTGGCTGGGACTATAG-3 f 



10* EXAMPLE: IDENTIFICATION OF HKNGl ORTHOLOGS 

This example describes the isolation and 
characterization of genes in other mammalian species which 
are crthologs to human HKNG1 . Specifically, both guinea pig 
and bovine HKNGl sequences are described . 

10.1. GUINEA PIG HKNGl ORTHOIiOGfi 

A guinea pig HKNGl ortholog, referred to as gphkngl8l5 9 
was isolated using RT-PCR. The cDNA sequence (SEQ ID NO: 38) 
and predicted amino acid sequence (SEQ ID NO: 39) are depicted 
in PIG. 7. Both the nucleotide and the predicted amino acid 
sequence of gphkng 1815 are similar to the human HKNGl 
nucleotide and amino acid sequences. Specif ically # the 
program ALIGNV2.0 identified a 71,5% nucleotide sequence 
identity and a 62.8% amino acid sequence identity using 
standard parameters (Scoring Matrix: PAM120; GAP penalties: 
-12 /-4) . 

Like the human HKNGl polypeptide , the predicted 
gphkng 1815 polypeptide also contains two cluster in 
similarity domains, which correspond to amino acid residues 
105 to 131 (clusterin domain 1) , and amino acid residues 305- 
333 (clusterin domain 2) , respectively. Both of these 
domains contain the five conserved cysteine residues 
typically associated with clusterin domains. Specifically, 
these conserved cysteines correspond to Cysl05, Cysll6, 
Cysll9, Cysl24 and Cysl31 (clusterin similarity domain 1) and 
Cys305, Cys315, Cys322, Cys325, and Cys333 (clusterin 
similarity domain 1) of the gphkng 1815 polypeptide sequence. 

Three allelic variants of gphjfeng 2815, referred to as 
gphkng 72>, gphkng 7a f and gphkng 7d, respectively, were also 
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identified by RT-PCR. Their nucleotide [SEQ ID NO: 40 
(gphkng 7b), SEQ ID NO: 42 {gphkng 7c), and SEQ ID NO: 44 
(gphkng 7d) ] and amino acid [SEQ ID NO: 41 (gphkng 7b), SEQ ID 
NO: 43 (gphkng 7c), and SEQ ID NO: 45 (gphkng 7d) ] sequences 
5 are depicted in FIGS, 8 through 10, respectively. Each of 
these three allelic variants contains a deletion within a 
region homologous to exon 7 of human HKNG1 . The allelic 
variants retain the open reading frame of the gene, however, 
each allelic variant contains a deletion, relative to gphkng 
1815, of 16, 92, and 93 amino acid residues, respectively. 

10 

An alignment of the predicted amino acid sequences of 
gphkngl815, gphkng 7b, gphkng 7c, and gphkng7d is shown in 
FIG. 14. An alignment of the predicted amino acid sequences 
of the human HKNG1 gene product, the guinea pig HKNG1 
ortholog gphknglS 15, and the bovine HKNGl ortholog described 
15 in Subsection 10.2 below are shown in FIG. 16. 

10.2. BOVINE HKNGl ORTHQI.OSS 

Bovine orthologs of HKNGl were also cloned by screening 
a cDNA library made from pooled bovine retinal tissue using a 
nucleotide sequence that corresponded to the complementary 

20 sequence of base pairs 910-1422 of the full length human 
HKNGl cDNA sequence (SEQ ID NO:l) as a probe. Three 
independent bovine cDNA species, referred to as bhkngl, 
bhkng2, and bhkng3 (SEQ ID NOS: 46 to 48, respectively) were 
isolated. Each of these allelic variants contains several 
single nucleotide polymorphisms (SNPs) . None of the SNPs 

25 results in an altered predicted amino acid sequence. Thus 
all three bovine cDNAs encodes the same predicted amino acid 
sequence (SEQ ID NO: 49) . These SNPs apparently reflect the 
natural allelic variation of the pooled cDNA library from 
which the sequences were isolated. Each of the three bovine 
- HKNGl allelic variants is depicted in FIGS. 11 to 13, 

30 respectively, along with the predicted amino acid sequence 
which they encode. 
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The predicted bovine HKNG1 polypeptide also contains two 
clusterin similarity domains, corresponding to amino acid 
residues 105-131 and amino acid residues 304-332, 
respectively, of SEQ ID NO:49. Clusterin domain i contains 
the five shared cysteine amino acid residues typically 
5 associated with this type of domain: Cysl05, Cyslie, Cysii9, 
Cysl24, and CysiSl. Clusterin domain 2 of the bovine HKNGl ' 
polypeptide contains four conserved cystein residues: 
Cys314, Cys321, Cys324, and Cys332. 



10 



ai - expression op mma ^ hkngi mara PRonngy 
Described in this example is the construction of 
expression vectors and the successful expression of 
recombinant human HKNGl sequences. Expression vectors are 
described both for native HKNGl and for various HKNGl fusion 
proteins. 



IS 



X1 **- EXPRESSI ON OV WTTMAK HlMSHBTAfl 

A human HKNGl flag epitope-tagged protein (HKNGl: flag) 
vector was constructed by PCR followed by ligation into an 
vector for expression in HEK 293T cells. The full open- 
reading frame of the full length HKNGl cDNA sequence (SEQ id 
2Q NO: 5) was PCR amplified using the following primer sequences: 

5» primer 5 » -TTTTTCTGAATTCGCCACCATGAAAATTA (SEQ ID NO: 52) 
AAGCAGAGAAAAACG^S * ; 

3' primer 5 • -OTTTTGTCCACJI^TC^CTTGTCGTCGTC (SEQ ID NO: 53) 

GTCCITTGTAGTCCCAGGTTTTAAAATGTTCCT 
TAAAATGC-3 * 

the 5« primer incorporating a Kozak sequence upstream of and 
25 including the upstream initiator methionine and the 3 'primer 
including the nucleotide sequence encoding the flag epitope 
DYKDDDDK (SEQ ID NO: 50) followed by a termination codon. 

The sequenced DNA construct was transiently transfected 
into HEK 293T cells in 150 mm plates using Lipof ectamine 
(GIBCO/BRL) according to the manufacturer's protocol. 
30 Seventy-two hours post-transf ection, the serum-free 

conditioned medium (OptiMEM, GIBCO/BRL) was harvested and 
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spun and the remaining monolayer of cells was lysed using 
2 mL of lysis buffer [50 mM Tris pH 8.0, 150 mM Naci, 1% 
NP-40, 0.05% SDS with "Complete" protease cocktail 
(Boehringer Mannheim) diluted according to manufacturers 
instructions]. Insoluble material was pelleted before 
5 preparation of SDS-PAGE samples. 

Conditioned medium was electroblotted onto a PVDF 
membrane (Novex) after separation by SDS-PAGE on 4-20% 
gradient gels and probed with an M2 anti-flag polyclonal 
antibody (1:500, Sigma) followed by horseradish peroxidase 
(HHP) conjugated sheep anti-mouse antibody (1:5000, 
1 Amersham) , developed using chemiluminescent reagents 
(Renaissance, Dupont) , and exposed to autoradiography film 
(Biomax MR2 film, Kodak) . Flag immunoreactivity appeared as 
a doublet of bands that migrated by SDS-PAGE between 60 and 
95 kDa as determined by Multimark molecular weight markers 
(Novex), demonstrating secretion of the HKNGlrFlag protein. 
The double band indicates at least two different species with 
different mobilities on SDS-PAGE. Such doublets most 
commonly arise with posttranslational modifications to the 
protein, such as glycosylation and/ or proteolysis. Treatment 
of the PNGase F (Oxford Glycosciences) according to the 
manufacturer's directions resulted in a single band of 
increased mobility, indicating that two original bands 
contain N-linked carbohydrate. When run in the absence of a 
reducing agent, the relative mobility of the immunoreactive 
bands was greater than 100 kDa relative to the same markers, 
indicating that HKNGiiflag fusion proteins may be a disulfide 
linked dimer or higher oligomer. 

11-2. EXPRESSION OP HUMAN HKNGl-Vl ; FT.R« 

A human HKNGl-Vl flag epitope-tagged protein (HKNG1- 
VI: flag) vector was also constructed by PCR followed by 
ligation into an expression vector, pMET stop. The full 
length open-reading frame of the HKNGl-Vl cDNA sequence (SEQ 
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ID NO: 6) was PCR amplified using the following primer 
sequences: 

5« primer 5 • — TTTTTCTGAATTCACCATGAGGACCTGGG (SEQ ID NO: 54) 
ACTAGAGTAAC— 3 1 

3 1 primer 5 1 -TTTTTGTCGACTTATCACTTGTCGTCGTC (SEQ ID NO: 53) 
> GTCCTTGTAGTCCCAGGTTTTAAAATGTTCCT 
TAAAATGC-3 1 

The 5* primer incorporated a Kozak sequence upstream of and 
including the upstream initiator methionine. The 3 9 primer 
included the nucleotide sequence encoding the flag epitope 
DYKDDDDK (SEQ ID NO:50) followed by a termination codon. 

The sequenced DNA construct was transiently transfected 
into HEK 293T cells in 150 mm plates using Lipofectamine 
(GIBCO/BRL) according to the manufacturer's protocol • 
Seventy-two hours post-trans feet ion, the serum-free 
conditioned medium (OptiMEM, GIBCO/BRL) was harvested and 
spun and the remaining monolayer of cells was lysed using 
2 mL of lysis buffer [50 mM Tris pH 8-0, 150 mM NaCl, 1% 
NP-40, 0.05% SDS with •♦Complete" protease cocktail 
(Boehringer Mannheim) diluted according to manufacturers 
instructions]. Insoluble material was pelleted before 
preparation of SDS-PAGE samples. 

Conditioned medium was electroblotted onto a PVDF 
membrane (Novex) after separation by SDS-PAGE on 4*20% 
gradient gels and probed with an M2 anti-flag polyclonal 
antibody (1:500, Sigma) followed by horseradish peroxidase 
(HIU?) conjugated sheep anti-mouse antibody (1:5000, 
Amersham) , developed using chemiluminescent reagents 
(Renaissance, Dupont) , and exposed to autoradiography film 
(Biomax MR2 film, Kodak) . Flag immunoreactivity appeared as 
a doublet of bands that migrated by SDS-PAGE between 60 and 
95 kDa as determined by Multimark molecular weight markers 
(Novex), demonstrating secretion of the HKNG1 : Flag protein. 
When run in the absence of reducing agent, the relative 
mobility of the immunoreactive bands was greater than 100 kDA 
relative to the same markers, suggesting that the HKNGl- 
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VI : flag fusion protein may be a disulfide linked dimer or 
higher oligomer. 

la " 3 ' EXPRESSI ON OF HUKAN HKMQltPe 

A human HKNGl/hlgGlPc fusion protein vector was 
constructed by PCR. The full-length open-reading frame of 
the full length HKNGl cDNA (SEQ ID NO: 5) was PCR amplified 
using the following primer sequences: 

5» primer 5 • -TTTTTCTCTCGAGACCATGAAAATTAAAG (SEQ ID NO-55* 
CAGAGAAAAACG-3 • J 

3 » primer 5 • -TTTTTCK3ATCCGCTGCTGCCCAGGTTTTA (SEQ ID NO: 56) 
AW AAATGTTCCTTAAAATGC-3 » ' 

The 5* primer incorporated a Kozak sequence before the 
upstream methionine to the amino acid residue before the stop 
codon. The 3 • PCR primer contained a 3 alanine linker at the 
junction of HKNGl and the human IgGl Fa domain, which starts 
ls at residues DPE. The genomic sequence of the human IgGl Fc 
domain was ligated along with the PCR product into a pCDM8 
vector (Invitrogen, Carlsbad CA) for transient expression. 

The sequenced DNA construct was transiently transfected 
into HEK 293T cells in 150 mm plates using Lipofectamine 
(GIBCO/BRL) according to the manufacturer's protocol. 
20 Seventy-two hours post-transfection, the serum-free 

conditioned medium (OptiMEM, GIBCO/BRL) was harvested and 
spun and the remaining monolayer of cells was lysed using 
2 mli of lysis buffer £50 mM Tris pH 8.0, 150 mM NaCI, 1% 
NP-40, 0.05% SDS with "Complete" protease cocktail 
(Boehringer Mannheim) diluted according to manufacturers 
2 5 instructions]. Insoluble material was pelleted before 
preparation of SDS-PAGE samples. 

Conditioned medium was electroblotted onto a PVDF 
membrane (Novex) after separation by SDS-PAGE on 4-20% 
gradient gels and probed with an M2 anti-Fc polyclonal 
antibody (1:500, Jackson ImmunoResearch Laboratories, Inc.) 
30 followed by horseradish peroxidase (HRP) conjugated sheep 
anti-mouse antibody (1:5000, Amersham) , developed using 
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chemi luminescent reagents (Renaissance, Dupont) , and exposed 
to autoradiography film <Biomax MR2 film, Kodak) . Human IgGl 
Fc immunoreactivity appeared as a doublet of bands that 
migrated by SDS-PAGE between 148 and 60 kDa standards of the 
Multimark molecular weight markers (Novex) , demonstrating 
5 secretion of the HKNGl:Fc fusion protein. 

11.4. EXPRESSION OF HUMAN HKNGl-Vl;Fc 

A human HKNGl-Vl/hlgGlFc fusion protein (HKNG1-V1 : Fc) 
vector was also constructed by PGR. The full-length open 
reading frame of HKNG1-V1 cDNA (SEQ ID. NO: 6) was PCR 
0 amplified using the following primer sequences: 

5» primer 5 ' -TTTTTCTCTCGAGACCATGAGGACCTGGG (SEQ ID NO: 57) 
ACTACAGTAAC-3 » 

3" primer 5 1 -TTTTTGGATCCGCTGCTGCCCAGGTTTTA (SEQ ID NO: 56) 
AAATGTTCCTTAAAATGC-3 ■ 

5 The 5 1 primer incorporated a Kozak sequence before the 

upstream methionine to the amino acid residue before the stop 
codon. The 3 1 PCR primer contained a 3 alanine linker at the 
junction of HKNG1-V1 and the human IgGl Fc domain, which 
starts at residues DPE. The genomic sequence of the human 
IgGl Fc domain was ligated along with the PCR product into a 

0 pCDM8 vector for transient expression* 

The sequenced DNA construct was transiently transfected 
into HEK 293T cells in 150 mm plates using Lipof ectamine 
(GIBCO/BRL) according to the manufacturer's protocol* 
Seventy-two hours post-transfection, the serum-free 
conditioned medium (OptiMEM, GIBCO/BRL) was harvested and 

5 spun and the remaining monolayer of cells was lysed using 
2 3BL of lysis buffer [50 mM Tris pH 8.0, 150 mM NaCl, 1% 
NP-40, 0,05% SDS with "Complete" protease cocktail 
(Boehringer Mannheim) diluted according to manufacturers 
instructions] . Insoluble material was pelleted before 
preparation of SDS-PAGE samples. 

0 Conditioned medium was electroblotted onto a PVDF 

membrane (Novex) after separation by SDS-PAGE on 4-20% 
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gradient gels and probed with an anti-human Fc polyclonal 
antibody (1:500, Jackson ImmunoResearch Laboratories, Inc.) 
followed by horseradish peroxidase (HRP) conjugated sheep 
anti-mouse antibody (1:5000, Amersham) , developed using 
chemiluminescent reagents (Renaissance, Dupont) , and exposed 
5 to autoradiography film (Biomax MR2 film, Kodak) ♦ Human IgGl 
Fc immunoreactivity appeared as a doublet of bands that 
migrated by SDS-PAGE between 148 and 60 kDa standards of the 
Multimark molecular weight markers (Novex) centered 
approximately between 125 and 150 kDa, demonstrating 
secretion mediated by the HKNGl signal peptide* 

0 

11.5. EXPRESSION OF HUMAN HKNGl A 7 : Fc 

A human HKNGl A7 thlgGlFc fusion protein vector was also 
constructed by PGR. The sequence of the HKNGl A7 splice 
variant was amplified by PGR amplification using Exons 1 
through 6 of the full length HKNGl cDNA sequence (SEQ ID 

5 

N0:1) as a template with the following primer sequences: 

5» primer 5 • -TTTTTCTGAATTCACCATGAAGCCGCCAC (SEQ ID NO: 58) 
TCTTGGTG-3 • 

3» primer 5 8 -TTTTTGGATCCGCTGCGGCCTCCGTG (SEQ ID NO: 59) 

GTCAGGAGCTTATTTTTCACAGAGGACCAGCT 
AG-3 • 

0 The 5 1 primer incorporated a Kozak sequence upstream of and 
including the upsream initiator methionine. The 3 1 primer 
included the firstl7 (coding) nucleotides of exon 8 followed 
by nucleotides encoding a 3 alanine linker. 

The genomic sequence of the human IgGl Fc domain was 
ligated along with the PGR product into a pCDMS vector for 

wt 

transient expression* 

The sequenced DNA construct was transiently transf ected 

into HEK 293T cells in 150 mm plates using Lipof ectamine 

(GIBCO/BRL) according to the manufacturer's protocol. 

Seventy-two hours post-transf ection , the serum-free 
~ conditioned medium (OptiMEM, GIBCO/BRIi) was harvested and 
0 spun and the remaining monolayer of cells was lysed using 
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2 mL of lysis buffer [50 mM Tris pH 8.0, 150 mM NaCI, 1% 
NP-40, 0.05% SDS with "Complete" protease cocktail 
(Boehringer Mannheim) diluted according to manufacturers 
instructions] . Insoluble material was pelleted before 
preparation of SDS-PAGE samples. 

5 Conditioned medium was electroblotted onto a PVDF 

membrane (Novex) after separation by SDS-PAGE on 4-20% 
gradient gels and probed with an anti-human Fc polyclonal 
antibody (1:500, Jackson ImmunoResearch Laboratories) 
followed by horseradish peroxidase (HRP) conjugated sheep 
anti-mouse antibody (1:5000, Amersham) developed using 

i0 chemi luminescent reagents (Renaissance, Dupont) , and exposed 
to autoradiography film (Biomax MR2 f ilm, Kodak) . Human IgGl 
Fc immunoreactivity appeared as a band that migrated by SDS- 
PAGE between 42 and 60 kDa relative to Multimark molecular 
weight markers (Novex) centered approximately between 36.5 
and 55.4 kDa relative to Mark 12 molecular weight markers 

15 (Novex) . 

11.6. EXPRESSION OF NATIVE HUMAN HKNGl 

A human HKNGl expression vector was constructed by PGR 
amplification of the human HKNGl cDNA sequence (SEQ ID NO:l) 
followed by ligation into an expression vector, pcDNA3.1 

20 

(Invitrogen, Carlsbad CA) . The full open-reading frame of 
the HKNGl cDNA sequence (SEQ ID NO:5) was PCR amplified using 
the following primer sequences: 

5' primer 5 ■-TTTTTCTCTCGAGGACTACAGGACACAGC (SEQ ID NO: 60) 
TAAATCC-3 » 

3 • primer 5 • -TTTTTGGATCCTTATCACCAGGTTTTAAA (SEQ ID NO: 61) 
25 ATGTTCCTTAAAATGC-3 • 

The 5 1 primer incorporated a Kozak sequence upstream of and 
including the upstream initiator methionine. The 3* primer 
included a tandem pair of termination codons. 

The sequenced DNA construct was transiently transf ected 
into HEK 293T cells in 150 mm plates using Lipofectamine 
30 (GIBCO/BRL) according to the manufacturer's protocol. 
Seventy-two hours post-transfection, the serum-free 
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conditioned medium (OptiMEM, GIBCO/BRL) was harvested and 
spun and the remaining monolayer of cells was lysed using 
2 mL of lysis buffer [50 mM Tris pH 8.0, 150 mM NaCl, 1% 
NP-40, 0.05% SDS with "Complete" protease cocktail 
(Boehringer Mannheim) diluted according to manufacturers 
5 instructions] . Insoluble material was pelleted before 
preparation of SDS-PAGE samples . 

Conditioned medium was electroblotted onto a PVDF 
membrane (Novex) after separation by SDS-PAGE on 4-20% 
gradient gels and probed with an anti-HKNGl polyclonal 
antibody (#84 , 1:500) followed by horseradish peroxidase 

10 (HRP) conjugated sheep anti-mouse antibody (1:5000, 
Amersham) , developed using chemi luminescent reagents 
(Renaissance, Dupont) , and exposed to autoradiography film 
(Biomax HR2 film, Kodak) . HKNG1 immunoreactivity appeared as 
a doublet of bands that migrated by SDS-PAGE between 60 and 
95 kDa as determined by Multimark molecular weight markers 

15 (Novex) . 



11.7. EXPRESSION OF NATIVE HUMAN HKNG1-V1 

A human HKNG1-V1 expression vector was also constructed 
by PCR amplification of the human HKNG1-V1 cDNA sequence (SEQ 
ID NO: 3) followed by ligation into an expression vector, 

20 

pcDNA3 • 1 * The full open-reading frame of the HKNG1 cDNA 
sequence (SEQ ID NO: 6) was PCR amplified using the following 
primer sequences: 

5* primer 5 • -TTTTTCTGAATTCACCATGAAGCCGCCAC (SEQ ID NO 5 62) 
TCTTGGTG-3 ' 

5« primer 5 • -TTTTTCTCTCGAGACCATGAGGACCTGGG (SEQ ID NO: 63) 
25 ACTACAGTAAC-3 1 

3' primer 5 1 -TTTTTGGATCCTTATCACCAGGTTTTAAA (SEQ ID NO: 61) 
ATGTTCCTTAAAATGC-3 • 

The 5* primer incorporated a Kozak sequence upstream of and 
including the upstream initiator methionine. The 3 1 primer 

_ included a tandem pair of termination codons. 

30 The sequenced DNA construct was transiently transfected 

into HEK 293T cells in 150 mm plates using Lipof ectamine 
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(GIBCO/BRL) according to the manufacturer's protocol. 
Seventy-two hours post-transf ection, the serum-free 
conditioned medium (OptiMEM, GIBCO/BRL) was harvested and 
spun and the remaining monolayer of cells was lysed using 
2 mL of lysis buffer [50 mM Tris pH 8.0, 150 mM NaCI, 1% 
NP-40, 0.05% SDS with "Complete" protease cocktail 

(Boehringer Mannheim) diluted according to manufacturers 
instructions] . Insoluble material was pelleted before 
preparation of SDS-PAGE samples* 

Conditioned medium was electroblotted onto a PVDF 
membrane (Novex) after separation by SDS-PAGE on 4-20% 
gradient gels and probed with an M2 anti-HKNGl polyclonal 
antibody (#84, 1:500) followed by horseradish peroxidase 
(HRP) conjugated sheep anti-mouse antibody (1:5000, 
Amersham) , developed using chemiluminescent reagents 
(Renaissance, Dupont) , and exposed to autoradiography film 
(Biomax MR2 film, Kodak) . HKNG immunoreactivity appeared as 
a doublet of bands that migrated by SDS-PAGE between 70 and 
95 kDa as determined by Multimark molecular weight markers 
(Novex) , demonstrating secretion mediated by the HKNGl signal 
peptide. 



11.8* EXPRESSION OF HUMAN HKNG SAP FUSION PROTEINS 

10 * 

Expression vectors were also constructed for human HKNGl 

alkaline phosphatase C-terminal fusion protein (HKNGl :AP) , 

human HKNG1-V1 alkaline phosphatase C-terminal fusion protein 

(HKNG1-V1:AP) , and human HKNGl alkaline phosphatase N** 

terminal fusion protein (AP: HKNGl) « 

The expression vector for human HKNGl :AP was constructed 

* 5 by PCR amplification followed by ligation into a vector for 

suitable for expression in HEK 293T cells. The full-length 

open-reading frame of human HKNGl (SEQ ID NO: 5) was PCR 

amplif ied using a 5 1 primer incorporating an EcoRI 

restriction site followed by a Kozak sequence prior to the 

" upstream initiator methionine- The 3 1 primer included a Xhol 

10 restriction site immediately following the final codon of 
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HKNG1. Thus, the open reading frame of the construct 
includes the HKNG1 signal peptide and the full HKNG1 sequence 
followed by the full sequence of human placental alkaline 
phosphatase. 

The expression vector for human HKNG1-V1:AP was 

5 

constructed by PCR amplification followed by ligation into 
pMEAP3 vector. The full length open reading frame of human 
HKNG1-V1 (SEQ ID NO: 6) was PCR amplified using a 5' primer 
incorporating an EcoRI restriction site followed by a Kozak 
sequence prior to the upstream initiator methionine* The 3» 
10 primer included a Xhol restriction site immediately following 
the final codon of HKNG1-V1. Thus, the open reading frame of 
the construct includes the HKNG1-V1 signal and the full 
length HKNG1-V1 sequence followed by the full sequence of 
human placental alkaline phosphatase* 

The expression vector for human APrHKNGl was constructed 
15 b Y PCR amplification followed by ligation into the AP-Tag3 
vector reported by Cheng and Flanagan, 1994, Cell 79:157-168. 
The full-length open-reading frame of human HKNG1 (SEQ ID 
NO: 5) was PCR amplified using a 5" primer incorporating a 
BamHI restriction site prior to the nucleotides encoding the 
first amino acids (i.e., APT) of the mature HKKG protein, and 
20 a 3 i p r i m er that included a Xhol restriction site immediately 
following the termination codon of HKNG1. Thus, the open 
reading frame of the complete construct includes the AP 
signal peptide and the full sequence of human placental 
alkaline phosphatase, followed by the full HKNG1 sequence. 
The sequenced DNA constructs were transiently 
25 transfected in HEK 293T cells in 150 mM plates using 

Lipofectamine (GIBCO/BRL) according to the manufacturer's 
protocol. 72 hours post-transf ection, the serum- free 
conditioned media (OptiMEM, Gibco/BRL) were harvested, spun 
and filtered. Alkaline phosphatase activity in the 
conditioned media was quantitated using an enzymatic assay 
30 kit (Phospha-Light, Tropix) according to the manufacturers 
instructions. When alkaline phosphatase fusion protein 
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concentrations below 2 nM were observed , conditioned medium 
was concentrated by centrifugation using a 30 JcDa cut-off 
membrane. Conditioned medium samples before and after 
concentration were analyzed by SDS-PAGE followed by Western 
blot using anti-human alkaline phosphatase antibodies (1:250, 
5 Genzyme) and chemiluminsecent detection . A band at 140 kDa 
was observed in concentrated supernatant of HKNG1: AP, HKNG1- 
Vl:AP, and AP:HKNG1 transf ections . Conditioned medium 
samples were adjusted to 10% fetal calf serum and stored at 
4»c. 

10 11.9. PURIFICATION OF Flt&G-TAGCED HKNG1 PROTEINS 

The secreted flag-tagged proteins described in 
subsections 12.1 and 12.2 above were isolated by a one step 
purification scheme utilizing the affinity of the flag 
epitope to M2 anti-flag antibodies. The conditioned media 
was passed over an M2-biotin (Sigma) /streptavidin Poros 

15 column (2.1 x 30 mm, PE Biosystems). The column was then 
washed with PBS, pH 7.4, and flag-tagged protein was eluted 
with 200 mM glycine, pH 3.0. Fraction was neutralized with 
1.0 K Tris pH 8.0, Eluted fractions with 280 ran absorbance 
greater than background were then analyzed on SDS-PAGE gels 
and by Western blot* The fractions containing flag taged 

20 protein were pooled and dialyzed in 8000 MWCO dialysis tubing 
against 2 changes of 4L PBS, pH 7.4 at 4 Q C with constant 
stirring. The buffered exchanged material was then sterile 
filtered (0.2 Mm, Millipore) and frozen at -80GC. 

IX. 10. PURIFICATION OF HKNG1 FC F USION PROTEINS 

25 

The secreted Fc fusion proteins described in Subsections 
12.3-12.5 above were isolated by a one step purification 
scheme utilizing the affinity of the human igGl Fc domain to 
Protein A. The conditioned media was passed over a POROS A 
column (4.6 x 100 mm, PerSeptive Biosystems); the column was 
then washed with PBS, pH 7.4 and eluted with 200 mM glycine, 

30 

pH 3.0. Fractions were neutralized with 1.0 M Trxs pH 8.0. 
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A constant flow rate of 7 ml /rain was maintained throughout 
the procedure, Eluted fractions with 280 nm absorbance 
greater than background were then analyzed on SDS-FAGE gels 
and by Western blot. The fractions containing Fc fusion 
protein were pooled and dialyzed in 8000 MWCO dialysis tubing 
5 against 2 changes of 4L PBS, pH 7,4 at 42c with constant 
stirring. The buffered, exchanged material was then sterile 
filtered (0.2 jm, Millipore) and frozen at -80sc. 

12. PRODUCTION O? ANTI-KKNG1 ANTIBODIES 

Described in the example presented in this Section is 

10 

the productxon and characterization of polyclonal and 
monoclonal antibodies directed against HKNGl proteins. 

12.1. PRODUCTION OP POLYCLONAL ANTIBODIES 

Polyclonal antisera were raised in rabbits against each 
of the three peptides listed in Table 3 below. Each of the 
peptides was derived from the HKNGl amino acid sequence (SEQ 
ID NO:2) by standard techniques (see, in particular, 
Harlow&Lane, 1988, Antibodies; A Laboratory Manual, Cold 
Spring Harbor Laboratory Press, the contents of which is 
incorporated herein by reference in its entirety) . Each of 
20 the peptides is also represented in the HKNGl -VI polypeptide 
sequence (SEQ ID NO:4) . Antisera was subsequently affinity 
purified using the peptide immunogens. 



TABLE 3 



Antibody 



Peptide/ Immunogen 



a* a* residues 



25 



(SEQ ID NO:2) 



Antibody 84 
Antibody 85 



APTWKDKTAI SENLK 



KAIEDLPKQDK 



50-64 



304-314 



Antibody 86 



KALQHFKEHFKTW 



483-495 



12.2. 



PRODUCTION OF MONOCLONAL ANTIBODIES 



30 
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Monoclonal antibodies were raised in mice by standard 
techniques (see, Harlow & Lane, supra) against the HKNG-Fc 

fusion protein described in Section 11.3 above. Wells were 
screened by ELISA for binding to the HKNG-Fc fusion protein* 
Those wells reacting with the Fc protein were identified by 

5 

ELISA for binding to an irrelevant Fc fusion protein and 
discarded. HKNG-Fc spegific wells were tested for their 
ability to immunoprecipitate HKNG-Fc and subjected to isotype 
analysis by standard techniques (Harlow & Lane, supra) , and 

eight wells were selected for subcloning. The isotype of the 

^ subcloned monoclonal antibodies was confirmed and is 
10 

presented in Table 4 below. 

Based on Western blotting, immunoprecipitation and 
imraunostaining data discussed in Subsection 12*3 below, two 
monoclonal antibodies (3D17 and 4N6) were selected for large 
scale production. 

15 

TABLE 4 



20 



Clone 


Isotype 


1F24 


2a 


1J18 


2a 


2020 


1 


3D17 


1 


3D24 


2a 


4N6 


1 


4016 


2b 


10C6 


2a 



25 

12.3* WESTERN BLOTTING AND IMK0N0PRECIPITATION 

OF RECOMBINANT HKNG PROTEIN 

The polyclonal antisera and all eight monoclonal 
antibodies described in subsection 12.1 and 12*2 above were 
30 tested for their ability to recognize recombinant HKNGl 

proteins on Western blots using standard techniques (see, in 
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particular, Harlow & Lane, 1988, Antibodies: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press). Polyclonal 
antisera 84 and 85 and monoclonal antibodies 3D17 and 4N6 
were able to recognize all forms of the mature (i.e., 

5 secreted) recombinant HKNG proteins tested (i.e., HKNGlrFc, 
HKNGl:f lag, AP : HKNG1 , and native HKNG1) in Western blots. 

Table 5 indicates the ability of each monoclonal 
antibody to immunoprecipitate recombinant HKNG1, as assessed 
by Western blotting of immunoprecipitates with the polyclonal 
antisera 84 and 85. None of the polyclonal antisera were 
10 able to immunoprecipitate recombinant HKNG1 proteins. All 
eight monoclonal antibodies immunoprecipitated HKNGlrFc. 
Immunoprecipitation of the other recombinant HKNG1 proteins 
was variable. 



TABLE 5 



25 



Monoclonal 


Protein 


Antibody 


HKNG1 : Fc 


HKNG1 : flag AP : HKNG1 


HKNG1 








(native) 


IF24 






-/+ 


1J18 






+/+ 


2020 




- + 




3D17 


+/+ 


+/+ 


+/+ 


3D24 


•f 






4N6 




**- + 


+ 


4016 


+ 




+/+ 


10C6 






4- 



13. CONFIRMATION OF THE HKNG N-TERMINUS 
AND DISULFIDE BOND STRUCTURE 

The experiments described in this section provide data 
identifying the N-terminus of the mature secreted human HKNG 
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protein. The experiments also provide data identifying the 
disulfide bond linkages between cysteine amino acid residues 
in the mature, secreted protein. 

Specifically, mature, secreted HKNG:f lag, HKNG, and 
HKNG:Fc recombinant proteins were produced and purified as 
5 described in Section 11 above. The mature recombinant 
proteins were digested xith trypsin, and the tryptic 
fragments were identified and sequenced using reverse-phase 
liquid chromatography coupled with electrospray ionization 
tandem mass spectrometry (LC/MS/MS) . The N-terminus of all 
mature secreted proteins tested was unambiguously identified 

10 as APTWKDKT, which corresponds to the amino acid sequence 

starting at alanine 50 of the HKNGl amino acid sequence (FIG. 
1; SEQ ID N0:2) or alanine 32 of the HKNG1-V1 amino acid 
sequence (PIG. 2; SEQ ID NO: 4). Thus, although the cDNA 
sequences of HKNGl and HKNGl-Vl encode distinct amino acid 
sequences, the mature secreted proteins produced by these two 

15 splice variants of the human HKNGl gene are identical, since 

the alternative splicing that gives rise to HKNG1-V1 (i.e., 

the deletion of exon 3 ) affects the amino acid sequence of 
the proteolytically cleaved signal peptide. The amino acid 
sequence of the mature secreted HKNGl protein is shown in 
ao FIG. 17 (SEQ ID NO: 51) 

The mature secreted HKNG protein is also distinct from 
the RPP amino acid sequence disclosed by ShimizuHMatsumo et 

al. (1997, Inv&st* Ophthalmal. Vis. Sci* 38:2576-2585) • 
In particular, amino acid residues 1 to 20 of the HPP amino 
acid sequence disclosed in Figure 3 of Shimizu-Matsumo et 

25 al., supra, correspond to the cleaved signal peptide of 

HKNG1-V1. The amino acid sequence of the mature secreted 
form of the HKNGl gene product is depicted in FIG. 17 (SEQ 

ID NO:51) . 

Disulfide bond linkages for 8 of the 13 cysteine 
residues in the mature, secreted HKNG protein were also 

30 

identified from LC/MS/MS of peptides recovered from tryptic 
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digestion of the unreduced protein. In particular , the 
following disulfide bonded pairs of cysteines were identified 
(numbering refers to the HKNG1 protein shown in FIG. l; SEQ 
ID N0:2) : 

Cys 134 to Cys 145; 
Cys 148 to Cys 153; 
Cys 160 to Cys 334; and 
Cys 354 to Cys 362. 

14. EXAMPLE; LOCALIZATION OF HKNG mRNA 
AND PROTEIN EXPRESSION 

This example describes experiments wherein the HKNG gene 
product is shown to be expressed in human brain and retinal 
tissue. Specifically, in situ hybridization experiments 
performed using standard techniques with a probe that 
corresponded to the complementary sequence of base pairs 910- 
1422 of the full length HKNG1 cDNA sequence (SEQ ID NO:l) 
detected HKNG messenger RNA in the photoreceptor layer (outer 
nuclear layer) of human retina in eyes obtained from the New 
England Bye Bank. 

The polyclonal antisera and all eight monoclonal 
antibodies described in Section 12 above were tested for 
immunostaining of human retina. Polyclonal antiserum 85 and 
monoclonal antibodies 1F24, 4N6 and 4016 showed 
immunostaining of HKNG protein in the photoreceptor layer and 
adjacent layers of the retina. The immunostaining in these 
tissues with polyclonal antiserum was blocked by 85 peptide 
immunogen, but not by the other two peptide immunogens (i.e., 
25 84 and 86) , confirming that the immunostaining was due to 
HKNG protein expressed in the photoreceptor layer. 

The same antibodies were then used to localize HKNG 
protein by immunostaining in sections of human and monkey 
brain. HKNG protein was observed in cortical neurons in the 
frontal cortex. The majority of pyramidal neurons in layers 
30 £V-V were immunoreactive for HKNG protein. A subpopulation 
of neurons was also labeled in layers I-III. HKNG 
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immunoreactivity was also observed in the pyramidal cell 
layer of the hippocampus and in a small number of neurons in 
the striatum. 

These data further support the fact that HKNG is, 
indeed, a gene which mediates neuropsychiatric disorders such 
5 as BAD. Furthermore, the fact that HKNG is also expressed in 
human retinal tissue suggests that the gene also plays a role 
in myopia conditions. Specifically, Young et al. (1998, 

-American Journal of Human Genetics 53:109-119) report a 

strong linkage (LOD = 9.59) for primary myopia and secondary 
macular degeneration and retinal detachment in the telomeric 
region of human chromosome 18p. Through fine mapping 
analysis, this candidate region has been narrowed to a 7.6 cM 
haplotype flanked by markers D18S59 and D18S1138 (Young et 

al., supra). However, the marker D18S59 lies within the 

HKNG1 gene. This fact, coupled with the finding the HKNG is 
15 expressed in high levels in the retina, strongly suggests 
that the HKNG1 gene is also responsible for human myopia 
conditions and/or other eye related diseases such as primary 
myopia, secondary macular degeneration, and retinal 
detachment. 



20 15. EXAMPLE: IMMATURE PROTEIN PRODUCTS OF 

THE HKNG1 cDNA SEQUENCES 

This section describes experiments which were performed 
to determine which of the two putative initiator methionines 
encoded by both the full length HKNG1 cDNA and the 
alternatively spliced HKNG1-V1 cDNA are used in the synthesis 

25 

of immature HKNG1 protein. The results indicate that both 
initiator methionines are used at varying levels, resulting 
in the production of three different forms of the immature 
HKNG1 protein, referred to herein as immature protein form 1 
(IPF1) , immature protein form 2 (IPF2) , and immature protein 
- form 3 (IPF3) . 
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Both the full length HKNG1 cDNA sequence shown in FIG- 1 
(SEQ ID NO:l) and the alternatively spliced HKNG1-V1 cDNA 
sequence shown in FIG. 2 (SEQ ID NO: 3) encode predicted 
proteins that have methionines in close proximity to their 
predicted initiator methionines- The predicted protein 
sequence encoded by the full length HKNGl cDNA sequence has a 

second methionine at amino acid residue number 30 of the 
amino acid sequence depicted in FIG, 1 (SEQ ID NO: 2). Thus, 
although FIG* 1 indicates that the full length HKNGl cDNA 

encodes the first immature form of the HKNGl protein depicted 
10 in FIG. 1 (referred to herein as IPF1) , the full length HKNGl 

cDNA may additionally encode a second immature protein form 
(referred to herein as IPF2) , whose sequence (SEQ ID NO: 64) 
is provided on the third line of the protein alignment 
depicted in FIG. 17. IPF2 is initiated at methionine 30 of 
the IPF1 protein sequence, and is identical to the RPP 
15 polypeptide sequence taught by Shimizu-Matsumoto et al (1997 r 

Invest. Ophthalmol* Vis. Sci. 38:2576-2585). Likewise, the 
alternatively spliced HKNG1-V1 cDNA sequence encodes the 
predicted immature protein form, referred to herein as IPF3, 
depicted in FIG. 2 (SEQ ID NO: 4) . However, the HKNG1-V1 cDNA 
20 may also encoded another immature protein form, identical to 
IPF 2, that is initiated at methionine 12 of the IPF3 protein 
sequence. FIG. 17 illustrates an alignment of the three 
immature HKNGl protein sequences IPF1 (second row) , IPF2 
(third row) , and IPF3 (bottom row) • As explained is Section 
13 above, the mature HKNGl gene product secreted by cells 

25 expressing the HKNGl constructs described in Section 11, 
above, is in fact the same cleaved product (SEQ ID NO: 51), 
regardless of the immature HKNGl protein (IPF1, IPF2, or 
IPF3) from which it is produced. An alignment of the mature 
secreted HKNGl protein is therefore also depicted in FIG. 17 

- (top row) . 

3 <> Modified HKNGl: flag and HKNG1-V1 : f lag expression vectors 

were constructed as described in Sections 12.1 and 12.2, 
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respectively- However, the nucleotide sequence of full 
length HKNG1 was modif ied, using standard site directed 
mutagenesis techniques, so as to introduce an additional base 
pair between the upstream methionine (i.e.,' met 1 in SEQ ID 
NO: 2) and the downstream methionine (i.e., met 30 in SEQ ID 
NO: 2). The nucleotide sequence of HKNG1-V1 was likewise 
modified, using standard site directed mutagenesis 
techniques, to introduce an additional base between its 
upstream methionine (i.e. , met 1 in SEQ ID NO: 4) and 
downstream methionine (i.e., met 12 in SEQ ID NO: 4). Thus, 

10 in both modified constructs, the c-terminal flag epitope tag 
was no longer in the same reading frame as the upstream 
methionine but was in frame with the downstream methionine. 
Consequently, exclusive translation initiation at the first 
methionine of a construct would lead to the production of 
non-flag immunoreactive proteins. However, exclusive 

15 translation initiation at the second methionine of a 

construct would lead to the production of flag immunoreactive 
proteins. 

Unmodified HKNGl:flag, unmodified HKNG1-Vl:f lag, 
modified HKNGl:f lag, and modified HKNG1-V1 : flag constructs 
were transfected into cells, and their resulting gene 

20 products were harvested, blotted onto a PVDF membrane, and 
probed with an M2 anti-flag polyclonal antibody, and 
developed according to the methods described in Sections 12.1 
and 12.2 above. 

Flag immunoreactivity was detected in all four samples. 
The unmodified HKNG1 : flag and HKNG1-Vl:f lag expression 

25 vectors produced amounts of mature secreted HKNGl:f lag 

protein consistent with the levels detected in Sections 12.1 
and 12.2 above. Further, the flag immunoreactive band 
detected for the modified HKNGl:flag construct was 
indistinguishable in intensity from the band detected for the 
unmodified HKNGl:f lag construct, indicating that the immature 

30 HKNGl protein produced by. full length HKNG1 cDNA is 
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predominantly IPF2, while IPF1 is produced by full length 
HKNG1 cDNA in relatively minor amounts. 

The flag immunoreactive band from the modified HKNG1- 
VI: flag construct had dramatically reduced intensity relative 
to the band from the unmodified HKNG1-Vl:f lag construct. 
Thus, HKNG1-V1 produces primarily the immature HKNG1 protein 

IPF3, while the immature HKNG1 protein IPF2 is produced by 
HKNG1-V1 in relatively minor amounts. These results are 

summarized below in Table 6. 



TABLE 6 



Construct 


Immature Protein 


Prominance 


HKNG1 


IPF1 (SEQ ID NO; 2) 


Minor 




IPF2 (SEQ ID NO: 64) 


Predominant 


HKNG1-V1 


IPF2 (SEQ XD NO: 64) 


Minor 




IPF3 (SEQ ID NO: 4) 


Predominant 



Thus, the HKNG1 gene products of the invention include 

gene products corresponding to the immature protein forms 
IPF1 and IPF3. However, preferably the HKNG1 gene products 
of the invention do not include amino acid sequences 
20 consisting of the IPF2 sequence (SEQ ID NO: 64). 

16. REFERENCES CITED 

The present invention is not to be limited in scope by 

the specific embodiments described herein, which are intended 

as single illustrations of individual aspects of the 

invention, and functionally equivalent methods and components 

are within the scope of the invention. Indeed, various 

modifications of the invention, in addition to those shown 

and described herein will become apparent to those skilled in 

the art from the foregoing description and accompanying 

- drawings. 
30 
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All publications, patents, and patent applications 
mentioned in this specification are herein incorporated by 
reference to the same extent as if each individual 
publication or patent application was specifically and 
individually indicated to be incorporated by reference. 

5 



10 



15 



20 



25 



30 
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WHAT IS CLAIMED IS z 

1. An isolated nucleic acid molecule comprising a 
nucleotide sequence that encodes a HKNG1 gene product 
comprising: 

(a) the amino acid sequence of SEQ ID NO: 2; 

(b) the amino acid sequence of SEQ ID NO: 4; 

(c) the amino acid sequence of SEQ ID NO: 39; 

(d) the amino acid sequence of SEQ ID NO: 41; 

(e) the amino acid sequence of SEQ ID NO: 43; 

(f) the amino acid sequence of SEQ ID NO: 45; 

(g) the amino acid sequence of SEQ ID NO: 49; or 

(h) the amino acid sequence of SEQ ID NO: 66. 

2. The isolate nucleic acid molecule of Claim 1, 
wherein the isolate nucleic acid molecule comprises: 

(a) the nucleotide sequence of SEQ ID NO:l; 

(b) the nucleotide sequence of SEQ ID NO: 3; 

(c) the nucleotide sequence of SEQ ID NO: 7; 

(d) the nucleotide sequence of SEQ ID NO: 34; or 

(e) the nucleotide sequence of SEQ ID NO: 35. 

3. The isolated nucleic acid molecule of Claim 1, 
wherein the isolated nucleic acid molecule comprises: 

(a) the nucleotide sequence of SEQ ID NO: 38; 

(b) the nucleotide sequence of SEQ ID NO: 40; 

(c) the nucleotide sequence of SEQ ID NO: 42; or 
<d) the nucleotide sequence of SEQ ID NO :44V 

4. The isolated nucleic acid molecule of Claim 1, 
wherein the isolated nucleic acid molecule comprises: 

(a) the nucleotide sequence of SEQ ID NO: 46; 

(b) the nucleotide sequence of SEQ ID NO: 47; or 

(c) the nucleotide sequence of SEQ ID NO; 48* 

5. An isolated nucleic acid molecule consisting of a 
nucleotide sequence that encodes a mature KKNG1 protein 
having the amino acid sequence of SEQ ID NO: 51. 
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6. An isolated nucleic acid molecule which hybridizes 
to the complement of the nucleic acid molecule of any one of 
claims 1-5 under highly stringent conditions comprising 
washing in 0„lxSSC/Ool% SDS at 68 ac. 

5 7 . An isolated nucleic acid molecule which hybridises 

to the complement of the nucleic acid molecule of any one of 
claims 1-5 under stringent conditions comprising washing in 
0.2xSSC/0.1% SDS at 50-65QC. 

8. The isolated nucleic acid molecule of Claim 6 or 7 f 
10 wherein said isolated nucleic acid molecule encodes a 

functionally equivalent HKNGl gene product. 

9. A vector comprising the nucleotide sequence of any 
one of Claims 1-5. 

* 5 10. An expression vector comprising the nucleotide 

sequence of any one of Claims 1-5 operatively associated with 
a regulatory nucleotide sequence controlling the expression 
of the nucleotide sequence in a host cell. 

11. A host cell genetically engineered to contain the 
20 nucleotide sequence of any one of Claims 1-5. 

12. A host cell genetically engineered to express the 
nucleotide sequence of any one of Claims 1-5 operatively 
associated with a regulatory nucleotide sequence controlling 
expression of the nucleotide sequence in said host cell. 

25 

13. An isolated polypeptide comprising the amino acid 
sequence of a HKNGl gene product having: 

(a) the amino acid sequence of SEQ ID NO: 2; 

(b) the amino acid sequence of SEQ ID NO: 4; 

(c) the amino acid sequence of SEQ ID NO: 39; 
30 (d) the amino acid sequence of SEQ ID NO: 41; 

(e) the amino acid sequence of SEQ ID NO: 43; 
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(f) the amino acid sequence of SEQ ID NO: 45; or 

(g) the amino acid sequence of SEQ ID NO: 49; 

(h) the amino acid sequence of SEQ ID NO: 66 • 

14. An isolated polypeptide consisting of a mature 

5 HKNG1 gene product having the amino acid sequence of SEQ ID 
NO:51. 

15. An isolated polypeptide comprising an amino acid 
sequence encoded by the isolated nucleic acid molecule of 
Claim 6 or 7 . 

16. An antibody which selectively binds to the HKNG1 
gene product of any one of Claims 13 or 14. 

17. A method for treating a HKNGl-mediated disorder in 
an individual comprising administering to the individual a 
compound which modulates the expression of an HKNG1 gene in 
the individual. 

18. The method of Claim 17, wherein the compound 
inhibits or potentiates the expression of an HKNGl gene in 
the individual. 

19. The method of Claim 17, wherein the compound is a 
small molecule. 

20. The method of Claim 17, wherein the HKNGl-mediated 
disorder is a neuropsychiatry disorder. 

21. The method of Claim 17, wherein the 
neuropsychiatric disorder is bipolar affective disorder or 
schizophrenia . 

22. The method of Claim 17, wherein the HKNGl gene 
encodes a HKNGl gene product comprising: 

(a) the amino acid sequence of SEQ ID NO: 2; 
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UA6 


amxno 


aClu 




of 2 




ID 


NO: 4 ; 


(c) 


4-Vi a 
T.XI6 


amxno 


acid 


s Gquenue 


wJL 






NO • 1 9 • 


(d) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 41; 


(e) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO:43; 


(f) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO:45; 


(g) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO:49; 


(h) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO:51; 


(i) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 64; or 


(j) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 66. 



23. The method of Claim 17 , wherein the individual is a 
10 mammal. 

24. The method of Claim 23 , wherein the mammal is a 
human . 



25. A method for treating a HKNGl-mediated disorder in 
15 an individual comprising administering to the individual a 

compound which modulates the expression or activity of a 
HKNG1 gene product in the individual, 

26. The method of Claim 25 , wherein the compound 
inhibits or potentiates the expression or activity of a HKNG1 

20 gene product in the individual. 

27. The method of Claim 25 , wherein the compound is a 
small molecule. 

28. The method of Claim 25 , wherein the HKNGl-mediated 
25 disorder is a neuropsychiatry disorder. 

29. The method of Claim 28 , wherein the 
neuropsychiatric disorder is bipolar affective disorder or 
schizophrenia . 

30 30m The method of Claim 25 , wherein the HKNG1 gene 

product comprises: 

- 139 - 



WO 99/47535 



PCT/US99/05606 



(a) 
(b) 
(c) 
(d) 
(e) 
(f) 

(g) 

<h) 
(i) 
(j) 



the 
the 
the 
the 
the 
the 
the 
the 
the 
the 



amino 
amino 
amino 
amino 
amino 
amino 
amino 
amino 
amino 
amino 



acid sequence of 
acid sequence of 
acid sequence of 
acid sequence of 
acid sequence of 
acid sequence of 
acid sequence of 
acid sequence of 
acid sequence of 
acid sequence of 



SEQ ID NO: 2; 
SEQ ID NO: 4; 
SEQ ID NO: 39; 
SEQ ID NO: 41; 
SEQ ID NO: 43; 
SEQ ID NO: 45; 
SEQ ID NO:49; 
SEQ ID NO: 51; 
SEQ ID NO: 64; or 
SEQ ID NO: 66* 



10 31. The method of Claim 25, wherein the individual is a 

mammal. 



32. The method of Claim 31, wherein the mammal is a 



human. 



15 33. A method for identifying a compound which modulates 

expression of an HKNG1 gene comprising: 

(a) contacting a test compound to a cell that expresses 
an HKNG1 gene; 

(b) measuring a level of HKNG1 gene expression in the 
cell; 

20 (c) comparing the level of HKNGl gene expression in the 

cell in the presence of the test compound to a 
level of HKNGl gene expression in the cell in the 
absence of the test compound, 
wherein if the level of HKNGl gene expression in the cell in 
the presence of the test compound differs from the level of 
25 expression of the HKNGl gene in the cell in the absence of 
the test compound , a compound that modulates expression of an 
HKNGl gene is identified. 



34. The method of Claim 33, wherein the HKNGl gene 
encodes an HKNGl gene product comprising: 

(a) the amino acid sequence of SEQ ID NO: 2; 

(b) the amino acid sequence of SEQ ID NO: 4; 
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TD 


NO 1 3 9 * 


(d) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 41; 


(e) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO:43; 


(f) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 45; 


(g) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO:49; 


(h) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


N0:51; 


(i) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 64; or 


(j) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 66. 



35. The method of Claim 34 , wherein the HKNG1 gene 
comprises: 



10 



(a) 


the 


nucleotide 


sequence 


of 


SEQ 


ID 


NO:l; 


(a) 


the 


nucleotide 


sequence 


of 


SEQ 


ID 


N0:3; 


(a) 


the 


nucleotide 


sequence 


of 


SEQ 


ID 


N0:5; 


(a) 


the 


nucleotide 


sequence 


of 


SEQ 


ID 


NO: 6; 


(a) 


the 


nucleotide 


sequence 


of 


SEQ 


ID 


NO: 34; 


(a) 


the 


nucleotide 


sequence 


of 


SEQ 


ID 


NO: 35; 


(a) 


the 


nucleotide 


sequence 


of 


SEQ 


ID 


NO: 38; 


(a) 


the 


nucleotide 


sequence 


of 


SEQ 


ID 


NO:40; 


(a) 


the 


nucleotide 


sequence 


of 


SEQ 


ID 


NO:42; 


(a) 


the 


nucleotide 


sequence 


of 


SEQ 


ID 


NO: 44; 


(a) 


the 


nucleotide 


sequence 


of 


SEQ 


ID 


NO: 46; 


(a) 


the 


nucleotide 


sequence 


of 


SEQ 


ID 


NO:47; 


(a) 


the 


nucleotide 


sequence 


of 


SEQ 


ID 


NO : 4 8 ; or 


(a) 


the 


nucleotide 


sequence 


of 


SEQ 


ID 


NO:65. . 



36. A method for identifying a compound which modulates 
expression or activity of an HKNG1 gene product comprising: 

(a) contacting a test compound to a cell that expresses 
25 an HKNG1 gene product; 

(b) measuring a level of HKNG1 gene product expression 
or activity in the cell; 

(c) comparing the level of HKNG1 gene product 
expression or activity in the cell in the presence 
of the test compound to a level of HKNG1 gene 

30 product expression or activity in the cell in the 

absence of the test compound, 
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wherein if the level of HKNG1 gene product expression or 
activity in the cell in the presence of the test compound 
differs from the level of HKNG1 gene product expression or 
activity in the cell in the absence of the test compound, a 
compound that modulates expression or activity of an HKNG1 
5 gene product is identified* 

37. The method of Claim 36, wherein the HKNG1 gene 



10 



15 



product 


comprises : 














(a) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 2; 


(*>) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 4; 


(c) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 39; 


(d) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO:41; 


(e) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 43; 


(f) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 45; 


(g) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 49 j 


(h) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 51; or 


(i) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 64. 



38. A method for identifying an individual having or at 
risk of developing a HKNGl-mediated disorder comprising the 
step of detecting the presence or absence of a polymorphism 
that correlates with an HKNGl allele associated with the 

20 disorder, wherein presence of the polymorphism indicates 
that the individual has or is at risk of developing the 
HKNGl-mediated disorder. 

39. The method of Claim 38 , wherein the mutation 
results in production of a protein comprising an amino acid 

25 sequence that is different from the amino acid sequence of 
SEQ ID N0:2 or 4. 

40. The method of Claim 39 , wherein the mutation 
results in the substitution of a lysine for a glutamic acid 
at amino acid residue 202 of SEQ ID NO: 2. 

30 



- 142 - 



WO 99/47535 



PCT/US99/05606 



41. The method of Claim 39, wherein the mutation 
results in the substitution of a lysine for a glutamic acid 
at amino acid residue 184 of SEQ ID NO: 4. 

42. The method of Claim 36, wherein the method 

5 comprises the step of analyzing the sequence of the coding 
region of the human HKNG1 gene by preparing and sequencing 
cDNA comprising a sequence that hybridizes under stringent 
conditions to the complement of a nucleotide sequence which 
encodes the polypeptide sequence depicted in SEQ ID NO:2. 
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XtX 0C& MX CftX CXC XXX CCT OCX CXO CAC GSX CXX2 CTG XTT XCX XXG raX OCT GGG 984 

C&ftCLCCEbDQKXtfiR CFKFK 546 

.c&e aus xax cox cio tot cog cax ctt gag cag am xtg xcx xgx xor tic xxx rrr cat 1044 

EKCQKCQXHIiSEDCPDV'PXr, 308 

GAX X&X XGC CAX XXX XCT CAG CCX CAC CXX OCT CAX.GflC TOT CCC CAX CTX OCT COT CTG 1104 

KXStiO BXX& fcVKVfiNQQ YGQ 388 

ckc XCX GAX TTX GAC GAG GOG JOC XGG XXG GTC XXX QEX TOO XAX Cty2 CAG XAtt GGC CAG 1164 

XfcQHXR JCBXiI^DSXY & VEK KR 408 

xrr etc cms xro ice oca xag cac tig gag cac xap-coc tax cts fcro gag xxo xtg agx .1224 

G Q F C K V S EliXNQXPETEIlF 428 

CO? CAXTTT COG TOG CIO XCT GAX CTC GCXXACCAG GCC OCX GAX XCX GAG XXC XTC TTT 1284 

Mfi XCVVPRIKEGKXSKODE T 4<Q 

XCX XCX CXS CSX CXX OCX XGQ XR CAT CXX GGX MX XXX TOCXXX CAX CXT GXX XCX 1344 

k u # t o i* e i t# v $, s » r x i- k x p x> e 4Ge 

XXG X5X5 XCX CAC TTX X#: XXTCtG OCT TOCTCCT XAX XTC XCX CTC XXG XTC OCT CTT GAX 1404 

BSXESSK^PXC YVVXKXL Q Kr 488 

CXX XCT OCX CXO ACT TCT *AC TZC XXX GCC XXC CXX CZC OSX XAX OCX CXX CAG CAT XTT 1464 

KEHFKXK* <S6 

XWS CXX CXT TTX XAX XOC T3G TAX 1468 

xxMocxiaaxxcKCf^^ 

CXftCft33SMj3C&fiXXCXXX3XMM^ 
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'X»S~k oC*0V* 'A&t£TI3Ti!>E XXK. JS9 

CXX TCT CM CAC TOT OCT C&T CX& OCT GCT CT3 C*C ACX CM TTX CW2 CW> GCG WC ^ 1077 

LVWVeKQQVGQIItQKTRKtllj 379 

TIG CXC XXT CIX tOC MT CM* CMS TAT GGX2 CN3 ACT CXC CW3 ATO CGG MS OC TIG 1137 

S O. ? XY I V „B K K & G. QF OK v'fiS.fc 3$9 

C3U3 C&C ADC G0C» $RT C3Xr GTG CMS X&3 JUX3 MX COG CM TTX GQC TOG CTC TCT CM CXG 1197 

X H Q X P ETElIFNfiXOVVFRt 41$ 

COL MC CMS GOC OCX CM ACX GM3 XCC XTC TTT X&X TCX ASCA CtyS OTA GTT OCX AGG JOT 1257 

KBGKZ SKQDET'HHTDES I li P 439 

CMP CM> GSX XVT AST TCC XM CKX CAT G&X ACX A2G XTO ACX GAG TTA AGO AIT CTO OCT Z2V7 

fiCHFTI/KlPLEESAESfiKFI 459 

TOC TCT XXT TIC ACX CXC AM5 XXC OCT CXT CM CM *GX GOT CXG AGT TCT A*C TTC ATX 1377 

CY-VVXKXLO^FKEHrKTW * 47& 

COC TAG GXX GIG OCX XM GCT CTX CM? Cfttf TIT MO CM CUT TXT XM XCC TGG *M 1434 

CfrMS>TCg M a%j CAa x^^ 

TMTGCM3MACACItf^^ 

TXW^AffTMTXCl^^ 
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ACAirTTAACXTTACTTATAGTCraTGC^ 

GTOTCTGGCATTTCAflTAGTGCTCTATTTT 

TGTGGGAAAATAACAAATACMCTra 

GGTCACAACTAAACCTAAATTAAATCX!T^ 

CAATTTTACCCTrCAGAGGGTCnX^ 

CGGCAGTACTAGCTGAAGTOATGGGTATGaAAGCAT^ 

GGCTGCAAGATGCTAACAGTTCAGGTT<^^ 

CAGTATGAAGGGCCTGATCTGAG^^ 

CCCAAGGATCAACOTATGAATTAAAGTAGAAATACC^ 

ACTGGGCTCAGCCTTGGCTACTG<XXXXX3^^ 

CACCATGCATACXXTCrAAAGC^^ 



GAGGAQCXnCTAGTGGAACAGGTCT^ 

TCTTTACAAAAGTAGAACACAAAAAAATTCAA^ 

GATTGGGAAGAACTGGCC^GGTGTGGTGCTCACX^ 

CCAGCTATGCAGQAQGCTGAG^ 

CCTKXX5TGACJU3AGCAAGA 

CCX^U^TAAAGAGAAAGCAfXU^ 

CGTAACAGATCAAAAGCAQIUB^ 

ACATAAGCAGAG<*ACATTAAAGGGACTTTTT^ 

TCTTGGC^TTGTTGCACKr^ 

TCACCCAGCTGCXXMAGGGA^ 

«TGGTGAATTOn»2ACCTAOQA3^^ 

GTCACftXXXKTGA^^ 

CCCAGCTATGAACTGTCAGCTGCCAACAC^^ 

TAaxMCHrrcsaAJ^^ 

TTCTTTTCWJU3W2A^^ 

AGCAATTCrTOAAOCTC^^ 

TTTACOQTOTTAOCCAGATTO^^ 

ACTlOTAAAAGClAAAOTTAaAOCX3ACAAACT 

JUUC&UXnOACCTAOA^ 
YCCCTGAQTOCWCTKSAGCTCOC^^ 

CTTOAOAcr^xxmrnt^ntL^ 

ATCTTCTCACCrCTGCCTC^^ 

<K^rroGaATm«xx»Torn^^ 

TACAttXOTQAOTCaCrG^ 
CTTXaU^TGCTGCTCTCTCAG 

AGAAGCATTCCCTCTCTOQGWSCCAAGATCTT^AA^ 

(XSGATTATQCTrAAGAACTG^^ 

TTCXATTACATCXritXATCCCX^ 

TCTGTCAGTCT<nTATAOQCrrO^GT^ 

TCIWTGCCATQTGGOATCTTGTOTCAATAGAATAAATA 
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TCU^AATAAGCGTTGtfWCXJCAGTCAA^ 

GGCIXXSTCTCCTTGAGGKTATOGTGCTGTTCTGCA 

TCTCTGCCaU^TGUWCCCm^ 

TATTATTTTTTTTTTTTGAC^C^ 

COOSGCTTCAKOTGATTCTCXmXa^ 

TAOTAGAGACAGTCTTTTGCXATATrAGTCTG^ 

AG^TTATAGGCGTGAGCCACCACTTGCT 

TGCCTTTATTCCTTCTCTAATGCTOTT^ 

AGC?ITPCCAGCAGGGC^CCACJk«rrACr^^ 

CACACAGCTTCTGK5AGGGACXK^CATW 

CTGOTAGTCAATCXWGTGACAGT^^ 

TCTCXJCTCTGTTGTCCAGC^^ 

CAGKX^TCTGAOTAGC^^ 

CXXX^GGATCX?TCTTCAGXXTO^ 

AATATTTCAJCTCX^CTTT^^ 

CATACCTCTCXXTCTCTT^^ 

TTATXXrTTTCTOTrCTAGT^ 

AAATATTTCTTCSCTTCCTTTGTGTTTTTTT^ 

<»TACTTCATTACCTGGOT^^ 

GAGXnTTTCAAGrrCCTCTCTOX^A 

ATTTATTCCrTCTOTCTAAC^^ 

ATCTATCATTCTACTCTm 

TGGCTTATTOCAtnTAAGATAATGACCTTTTA 

ATACTKSTTCXATTGTTTATATA^^ 

TGAATAGTGCTGCAATAAACATGQQGX^ 

QCTOGAT^TOK^ACU^^ 

AOTATATGAO'iUXXX?i w iU^ 

aATATCTCA*raarGGrm^ 

TCl^UTOATCAAiCnCTGT^ 

CIUXTI^TTTAATTCTCCXSACAATI^^ 

GATATTClUtfX3KiKCtt^ 

CrCR0OCCTTCAitf»A^ 
T l ^ C TlT TCW rrCTTT^ 
SOUKMATTAATOtfnT^^ 

AAACA<XXXTTTAATATGAOyrTCT 

AAA1WTCCTCTG0TCX30CTTQTTTTTC 

CTTTAGCTGTAACCCCTCaTATTATACAGGACXXr^ 

ATCAGCW^ICAGTCTTTTAA^ 

CAOQrrQCTAGCCTTT^ 

CCTCrCGCXX»OTTTCAAAAWCT 

TAAAACTCAGCJAAAGTCmJAOGGCCTCCAC^^ 



FIG. 3B 



WO 99/47535 



7/57 



PCT/US99/05606 



ATC&&TCTAAGAACUtGTTATTC&^ 

GTTGAACCCAACCXX^TATTATTTTCAATTAGC^ 

TGTATCTTTTTTCCTATTATGTTGCTQAAT^ 

TCATCH^TCTGCACATAlC^^ 

CTTGCAEX2ACTATATTAAAATAAGAOTOOTAAA 

ATTC»GCATAATCOTAGCX<?PAGGT<OT 

ATAAATCKOTAAATTTTGTCAAA^ 

GATTGATTTCTATTATTGAACCAGCCTGK^ 

TCTATTTQCrAATAriTTGTTAAGGATTTTO 

TCTCTCKSWTTGOTAT^ 

GATCTTCX3K^rCACTGCy\ACCTCCACCT 

CCACXXXCGACTAATraXHATTAftGCT^ 

WTGXGACAGTCWGC«SP1^^ 

TCCTGCXJTCAGCX?rTCXX^ 

TGCCCAGWX»TCTCAAGC^TTCACCTAC^^ 

TTOGTTAGKAT!TXarrC»GTGAM^ 

AGGCTCWAGTGCAGTOGTCimCCtC^ 

GACK!TGXiC^TTACAC*X!AGCGCX^ 

TCTCGAACTCCn^CCTCAAG?^ 

TTTCTTTTTGCWGAGTTI^^ 

AA?AXrrrTGTGGTTTATOAOTTTATATGGTC<^ 

ATCTTTTTTATGTTCACMGCT 

CTCrrCTCTCTQGTCACrrCTTT^ 

GCTTTTCTGTTTTrCAACrT^ 

CATATTCX^TGTOAAATCTTACATTATTCACT^ 

ACTGCATACPQCTTGAACTATC^^ 

CCrrAGTCAGUKGTGCATGXSTCTGTAC^ 
TATATATATATATACAGACACATATOTA 

GOXSTGATTAGATACTCOTQC^^ 
GAflAOAGGWOWGAACTeKX^ 

AltSXACCAAATTyAOQATTOCTATAA 

rATOTTATCtOUViTATAAAT^^ 
ACATGATACATCTTTTTCTATTCm 

*CATATAW3TAiCAyXCHW^ 

TTAOTAGyyGACCXSGTTCT 
OQCAraACSQCTGTCU^^ 

GTAATGTAAATGTTGATATGTAAGAGCTTGAATC^^ 

TTTTTTAGG<XiTTACTTTAAOT 

ATO3AiUitCTTTCXaCTC^^ 

GTCOTATTTTTGATTTMXn'ATCAAAT 

TGTTGTITCTTCXXTCTTTATGCCXX^ 

CCWAGTCACWUITTCTCTTAGCTT^^ 

ATTCTTGGCTATTGGCrrcrTTTCT^^ 

rACGTTCATGAAAGTATAArrC^CTaAATaW^^ 
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AATAGTACATTTGCAGAGAAATTGGGOAAATCAAACnCT 

AGAGAAATCXaTTTAAAGGAACTAAOTC 

ATGGGCXrrCCAAGOXrXTT^ 

ATATTTCAOGCTATIX^TXrrrGTAATA^ 

TCX^TAGAGAGTTGCAGATITCATTTTGATGT^ 

GATGACTGCITATGGGAAATGTGTCTGCTT^ 

TGATTTTAAC^TTTGCACTTAAA^^ 

AGAAGAGAGKX3AACTAAGGTCATG(^ 

TCTGGCTC^GGATCX^GGCrrCAAAGCCAAT^ 

AtTTAraOTAAATTAAGGQGATAATTCCXXAa^ 

CAGAAC^TTTTTGGAACTACGATTTT 

ATGGAGACTGGGCTTCATrTTTTGAA^ 

eX^JTOGTG^TTTAAAACAQTAAATGCCACCTAAAl^^ 

AAAAAACUUVGCAGAA£AAA<^ 

TGTGCATCCOGTGATGTTATTATTTTTAAAG^ 

AGATATAAAAAGQ«3AAACX2AGAAG<X^^ 

CAaX3AGAAGAGACACAOACTTT^^ 

GCACAGCCATTTCCTAGCTGCn^ 

TGTCA<XATGTTCTCAGAATTA 

GAAATCHACTCXXAAAAC^TAAAGTG^ 

TAAAC^TCKXrTCCCAAGCCAACGTTCATC^ 

GCCCTTTAACAAGGOTAGACTTGK^G^ 

CTGACATCCAGCX^TCACCATCGCATTAAG 
CCCCTO^IXXrrTTATCl^^ 
GCTTTTTTTTTTTTTAACTGAA^ 
CTGAATGTTCTTTCX^aW«Xy^^ 



CQTCACCSaCMXXttCXX^^ 




jtfauuyGKxxseaooc i uuircc^^ 

AAAACMXftQOAOaAGQ^^ 
00CC>OrAATCTCAACACT^^ 

AOuuuuwrAOKxwp^^ 

GAQGTT<X!AGTGACKXMAGATT(^ 
OCAAOOUSAAAAAGAATCACW»OAAA^ 
AtCAGTATTTaACaAJ^^ 
CCnX^WGUiCACTATTAACTOC?r 
WrTTATCTTGTTTTTCTCT^^ 
CAACAGAATTTTCAAACTOUVCATT 
ATTATOGGGAACSAGATQAOATACTGTTACCT 
AOOACATGCTOOAATGTGClXSAC^yCTAAAAA 
CAAACACrTQACATGTGGCTGT^ 
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CAGCCTCWACn'CTCTCTTGA^^ 

ACAACCAAAAAAGTGGCTCTGGCKCTGAGT^ 
QAATTTCPGCIWrrTTGAMJ^ 

GTCKXrrATGTKrtTCtGATTTJCT 

TATGTGAGCAAATTAGAGAAATGACTrAC^ 

A<K^TGATTATAGTAGATTTO^^ 

AAGOLAAGTAATATAGTTACXAQAATAGTATTTTTACATG^ 

TGTGCLAAAGAACAGAGACCTGGGTTAflATAAAAT^ 

TATCTTATTTATCTATGAAACAGGATTCOTAATAC^^ 

GCrTOTrGCCTGGGACA^^ 

TAGGTAATTrTTTAAACTTTAGAAATAATTC^ 

TTTATTACCCAAAATGRCCAACX^ 

CTTTOGGA/GCTGAGCnX^^ 

AAAAAATTAGCTC&GTGTC^'TG^ 

TGAACCCXXJCaUXSCXGaiGC^^ 

AAAAAAAATTAATAATAATTTAAACCXXSAAG^ 

ACTaZTTTATCTATTCTTCOT 

TTTTTATTTTTTATTTCXnTTCACT 

ACATTGTAAATACTACATATTGGCTA^ 

ATGXlftGn^TTATCTGGCTC 

AAAAAGTCTO^rTTGTTCCCAA 

TTTCKTrTCTAGAATATT^^ 

GGfcAATTTGTGGAAAGAGAGGXT^ 

GTTTTATAAGGAC1XjK9GCnrTAAVA04^ 

CAGTTAGAGACAGTTCCAAGOT 

TGAACXTTATATCATAAGAAATAAAAATCT 

CAGQGTTTCCCAGGCT^ 

CATCTGTCTCATTAAAATATCTG^ 

ATOGIXSTAATCTATCXXA C y i^^ 

QCMTGATCAAAATGAAOCTTTOCAflyCtA A 

iuc»iarraTAATcocAOTA^ 

GAACWmTGAACCAaGG^^ 
AAAAAAAAAAAAAATTAGCTOOATtnXXnGGCAC^ 

GOAGACGGACXOTO^^ 

ATAAAAAATAAATAAATACAJIAAM 

AAATTGGCCAAAATAOQTAKCIlGACK^^ 

ACTXOACCSCCAOCSAOCTCM^^ 

CAACPTGCrTTGCCrrCCTOOCTG^ 

TTACTGAATACTTACTCrACKXr^ 

TCATTTAGTACAGAGATTTAGTOTG^ 

GCAGCTCTQCXiTCTTCC^^ 
AACTCCa^CrrTCTQAAAACA^^ 
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CAATAGGAAAATAAATTATTGAAATAGA^ 

Ta^CXATCAAGAGTGCCCATGTAACAOAGATAAGTAAA 

CTAGAAGG<XX!AATATACAGCAAGGAGG<»AAAATAAACTACTGTG^ 

T^AGTGCAAAAGGAOAGAGTGA^ 

GCArTTOGATTTTATTCTGAGGCn^^ 

AACAATCAGAATCAACTAGATGGATTTAAGTATGGOT 

GACCAG^ACACrGGCATMAGCTCG^SAACT 

CAAC^TCCCAGOTGGCCt^TTTGTATG^ 

CAGACAGAATAC^CAAGTCAGAGAC^G^^ 

CAAAATGCAC^X^CAGTCTAGACAA^ 

ATGCGTAGGGAAGAACAATGCACCCTTTACCCAGCCTCCT 

GTGACATTGCTACAACCCATAGAGCTTATTC^ 

TTTATCATATGTOAAGCTTTGCTAaa\C^ 

CATTCKTTCCATCATTAACCCCTGGGAA 

CAGTGTGTATCTTTTXXX^TTGGTAACAGAGCAA 



TAATTITOTGTATTTTTATAAT^ 

CCCTCCXAAAATGCrGGGAGTACAQGCAT^ 

TCtfATCAATACrTCACTCXTTTC^^ 

TGAATGAATGGAATAGAATGGATAGGATTTAGl^^ 
*W3TG<»GCX»GCAAGrACCAGA^ 
AATAGTATCCTTTTCAAATGAAAAC^^ 
TTTTTOCTCCTTAGCrauri^AG 

CTTCTATATAAAACCTGA^ 

TGGGACCTTCGTCCACTAGGGAC^^ 

GGACCCTTCCTTTACTTTCTTC 

CATTTTAAATATTTAAGTAAACTTAAAATGGTGTTT^ 

AGTTGTTTGGGCTCCTATTCTACAATGTGCT 

AATTTOAAgAAGOAACATTCTCTACI^^ 

GACCTTGTCTCAAAAAAAAAAACAAAAAAGAA^ 

GAOTAATTCCAGTATAGTXXiTTTTTTIG^ 

aTACOATCTTGXX^CACTGCAA^ 

CX3CCACCATGTCXyU»ATAATTTTGTATTTTTTOT 

CTCMXXSrOCXa^AaTGCTQC^ 

TTTTAATGAATTTACACGTTACCC^^ 

AAGAATATCCAACTOCTCCCCrAAQGGTGG 

CMGCTCCTAOGTCAGGCTCCTCr^^ 

ATCCCCCTTOTCCAACMG^ 

TCCATOTCACCAACXMtCTO^^ 

CCOTCAAATCCWAGTTCSW^^ 

CCATCTTCCTTCmXttC^ 

CCTCCCAOAACAACCACCGCCTTCTTT^^ 

CAAQAAAAQAAAAAqAQCATTATATAATAAAACGTAAAACAAQAA 
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GGAGTGTTTTAGAGCACSGAKTGTTCTTGGGK^ 
CTACTTTGQUSTGTTGATAQJ^ 

TGTCACCCAGCKX^GAATCXAa^^ 

CTACTAG CTGGGATT ACAGGCTTGCACX!ACCTACGTCC^ 

ATTTTATTTTATAATTTTGTTTTAGACAAXX^ 
GATCTCCTGGGTTO(^GTGAGCCTTAGCC^^ 

GGATTATAGGAG1X2AACTAC$^^ 

TCCTGCrrGTTAAGCGATCCTCCCAOT 

TTCTTCTGTGCCATTTGTGCACTTCACC^^ 

ATAGTCCTCAAAGTITTTOTACOT 

AOTGACCATCAGGGAACCGTC^ 

ACTTCCCTGTCITGCTCTGTGCCTTT^^ 

GAATAATTATGirrTTATTCXyVGAACCX^ 

Tyixn.T^'GTTTTTTAATGCAGA 

TTAAAACAAACAAACAAACAAACAACTTCIX^^ 

CTACTACACTCXACXXTGGCXIAACAGA^ 

CAATCTGTCrCCTTTGGC^ 

ATTTATAAGACATCX^TATAATTTA^ 

GAGTCAGCACACAAGGATTCTTTTT^ 

TGnWTGGGATTTTOAAATATTT^ 

ATATACACCTTATACACCTAiCXX^ 

eATCAQQAAGOUtf^ 

GftCJUUX^AATCATTTTCT^ 

ttXMCACATQM8GTTJ^^ 
CMCTCeCATOVCrTTJy^^ 
CTACTAItAAATAII^^ 

AAAAAAAAJGXOTAOATTTTGaAGCATTrT^ 

CUUUUCyTAACIUUMUCI^^ 

GCCAATTATOGATTAC3AX3GAT<»aACTTA 

CX»ftAGACTCAATAGlXZTTCCATTC^ 

TGGATTCaU^GGACXK2*a^^ 

AA<XH , GG«TTA<XyUUrrOTTQA^ 

GGTTOCNSGTAACTTC^^ 

OAGGACAGACXaUWSAAGGGTCAf^^ 

CAOCAAATACCGTAACAOAGATGAATCC^ 

ATCTCTTOACW»UXaunTOaAO^ 

<»TOAATOCATAAOCTG<XSNX^^ 
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GAGWCTCXTTGTCX^ACGTCTTG^ 
CCTGCTCTOOCTC^CACTW CT 
TACKGCTCACXTOCACCTTTGC^^ 
CAGATTCXX^CTCATACCATAATTCXX^^ 

NtnxnCATCXXAGCTACTCAGC^GGCTOAGGCGGGA 

OAATAGCCACTGCACICTAGCCTGGGCAACATATC^ 

AAAGGTAACCACATCCTCK^ACAAANAAAAGAAGKTC 

TGCTTTAAAATCAGTTGCTCATCCAATTCC^ 

CTTCTTGATC^CTTTTAGCAACAAAAATTC^ 

AAGGCCTGIXKXajtXX^TGacaXX^C^ 

AAGAATCCACK^CXXACCTTTGATTC^ 

CTCATTCTCCAAAQAGGMACKATTATCTCl 

CCATTCTTCTCAGGAaUU^ACACT 

CTTCAAAAAATGTATGTATTTCTGACTTT^ 

raX?TC»CTGCACTCWXX^ 

CTTTCAGGCCAGCCCXOTTC 

TGAAACTCTTAATACIE'CTTTGAACACGGGG 

ATIXMTATGJUSAAGTCTCCT^ 

TCATTCTX^CTyA AWri Cy^ 

ATXXH'AATCAATAATTAAATTTChCTAGTGCTGATCT 

GAlTlTGlATTQUkACTAGATCr^^^^ 

AAACAATrATTrrAraCT A r rfJ re ^^ 

TT TQ T A ' lUTW AlOTA 

CAAAGTOTOX^TTACAQGCATGAGCa^^ 

CTaXttAOTAGCCAAGATT^^ 

AEMCTOGnCXCAAMHCC^^ 

CTJWCTCTqAJCTTITTAAAA 

u£A0C9AVGXACGAflluC^^ 

CCCSIPAAOTCAGTGATC^ 

AQGLACTrACACTCT(X!TTCTAAA^ 

AA0G\AA£UUU2OAJU3GGAG^ 

MQMOAuACGUAOOAIUU»^^ 

CTTTTOTTTTACTTCTCTTATGCATATTCTCC^^ 

CTTCTAGCAOCX^AACAGCCTIKX^ 

©GSWJCTCMJTTGACX^ 

TQTCTAC^QC^CACATGCACK^ 

TOMX3^ATA?JUX?CTTC^ 

GQCAACTCTAACTCAACX3CmS&^ 

AAAOCTXaCATTTTOWUWCCA^ 

TWtXJTTCTAAATCTCClT^AATTATCA 
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GAAAGTCMCTCCKSWTTCACATCyU^ 

rrTTTGGAAGCTTCOM^TATPTTCAAACT 

TGGATTAAACUUUUlAAATCnXJCTAC^ 

CAACAAAATTTQGGTCIXSTTG^ 

GCAATGtH'GCACAjGAGTCACaOTAA^ 

OX^AAJCnXXZroCCra^ 

AGTGACATTACXXrrGTTTAA^ 

AAGACX^WAXTTAGTGGGTAATCAAGGCAGCTGA^ 

AOATAAACTCCAG^CTGCATGTTGCT^ 

GGTCTGACCWCTTGGACAAGTGCTGAGTACATA^ 

GAX^TTTOVGGTGCCXySAGAGACC 

ATTAAGGGCTACATTTTCAGTT^ 

CTTTATCACTTTAGTTQCTT^ 

CTQTGTAAATACACTTTTCAAACTGTT^ 

TTTTCX2TTACATA<X3AAACTAC^ 

ATATAGGAACGTAGCCCAAOGO^CTGTTGCXM 

TCATAOTTOTAAATCrCAAAATGTTGGGT^ 

CTA?ATTAAAA\GTTTTGTTTTOTAAATAGACT 

TTACTACTACACTTCXTAGTT^^ 

TTAAAATAGTAAATATACGTAAACCA^ 

AAGCATACSACTATTCTAACrrTATTTAAC^ 

TAAACTCAAAGGGAAATTCATTAACIX5AGAAGAAA^ 

CAAAA\ATGTGTGTGGTTGGTTTTGCT^^ 



COWQCCTGACCCACATGGAG^ 
TOCWAACGCTGAGGCACttAGAATC^ 
ACAAGAGCAAAACTCCGTCTCAAAAAAAAAAAA 
OTGTTTCX3ACATGTTTAX2AAAATGAAGCTTGGA 



lYrTCAOGCACAt^AGCATAAQ^rGT^ 




CMOGTCTCTCChQC^SZZCAXXTTTCCAC^ 

MKTGKjnrtrOGTGtTCATC^^ 

QCAA1 W XX»TTCTTAAACTAOTQAAAGAATO 



OOTOGATIOAGTTATTAAjQOAAOCTOG^ 
GACAAATTqCTOACTOrC^^ 

CTTGAXmrTTAAATGAAAGAAAAAAAATGGATAAAT^^ 
TTAGGCGC1X»CKAGTGK5C3ACTA<^ 
TTTCAGAATATGAGCCTATAAaAGAACAATT 
GATCTCAAAOAAAAAATTGCCACATOF^ 




LTCCCAGCACTTTGGGAGGCCGAGGGG^ 



rAACT G TOGTOAAGTCTAAACCAAA^ 



*AACTTACATTTCyVAGCTlX!CTTATG 
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CACCATAGTrrTCCC^ 

^AAAACTAGATAATAAAATCCATCAOTCTACCTGACrrT^ 
^"^"^AC^ATAT^^ 

GTTCAGTAAATTTCTAAATTATACSACATTGAAX'AGCTTGCAGTTO 

TAAAAAAO^ATCTTCAAMCATTKCCCTTAAA^ 

MOTTGAGAGTAAATTACTTTTTAAATKTATOl"^ 

AOCATTTTTA^AAOGTTCAOTAG^^ 

TCAAACTCTATA^^^^ 

AJmaAW^TTTKJAOaUUWCTA^^ 
^CCT^CCATCTGG^ 

OCTAMCCACMCTXTCATTTTATACKn.AW^C^^ 

O^ATACAGCTTTAACAAC^TATTCCACTTTC^rG^^ 
ACTACTAGAATTTTTCmTATTCTTAAWUVTCTTATT^^ 
GTTAATCCWAAAAAAAAAATCTGTTTA^^ 
^ATMOCAAATAW^ 

ocwaxacAiu^Toctccw^^ 

JWWOGCtTWeATAAACWK^ATW^ 

ACI^TOtCTO^A^TAAWXaTACTAOOTATAAT^ 
JW^ACaTCATTCCTATTAWCWAWTGCIkAAT^^ 

CAOCATCATGACCAGCTAATTrTTCTATTTCTACCAGAGAC^^ 
<»WCCA«XaWGtaAAGT«»ra^ 
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TACTCXXXU\GGCTGAGGC*GCAtM^ 

GCCttCACAGCCAOACTCTGTCTCA^ 

CAOlCTCCrGTCKXXn'CAA^ 

GC^XX^AACTTTACAATGAGGACAACK^ 

GCTTAATACTCCAGGAAAGGGTCATTTCCl^Tr 

CGATATCTCGC^TGC^GGGGaCrGGGCr^^ 

ACCCAOAGCAGGACTTGGCCMGTAlC^^ 

CXATTTATATTTATTTAATA?^^^ 

ATTACAATCffrAACTCAAOGGGAACATTTAACrrG^ 

AGGATAAATGAAAGAATTAAAATC^ 

CTTACXX3GTAAATAOGTAAATCAC3U^ 

ACATWX^AACACTCACTAAATACTTAGTaVACA^ 

CTTTTCTTAAACTAAACAGAAAGCrT^ 

GTACATTTCAAATOCTXaAATC^ 

TOAAACTt«aAATQAJU3TTC»A^ 

TCTTOCCTOOAAAATAACWKUMAfl^ 

CTCAAGATTTCAOUnTCTTAA^ 

TAGGGXXXTTTTGCTTTCTAG^ 

TGACXXXSKXK^rCAT^^ 

VfTCAGTCTCTGATTTCAATGCft 

ACATGKTTTGCXnxXrrTATGCAT^ 

TATGttAGACFGGCA UOT T ^ ^ 

CTCTOCCCCACTAACGGCTO^rTAAGCT 

CCATATTAGAGCAOLTA<^AA^ 

TGKXWXAAGTTAATTl!CCX?ICCACTATTA 

TTeKrGAOACAGG&TCX^^ 

TQATTCTCXnXXXnt^MSCC^^ 

GGffCTTACCAlGnGGC^^ 

CmXSGCAATOTAAACCACWAOAAA^ 

AGnTAATTGCCtCI^ACVAGS^^ 

OACCACCCTQCXXaUtCAC^ 

CTAcrra^waiCXKnxys^^ 

AfiffOACAGftQCAMZACT^ 

ACSAAAGAAIUSAAIUUUUUQAAAaAAAaAA 

GAAAGAGUUUWOAAAGAAAAG&AAAM 

A C QgcttTCJuaBucA^^ 
ccjuacKrxc»£CJUuu^^ 

TOTCTCTaTlMTOJUUUJTCm 

AAAACATTACTTTTACTTACACmTTACACT 

OrrrrCWGAAGAGACAATWATOGAAA^ 
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TCGTTGAGAACTGAACTCTTTCTA^ 
rrCCCTCIlTTTATTATTTCTTTAm 

CAGCTJATTTTTTTCTATTOT 

AGCTCTAT-TCTGCTAAAGC^ 

ACX»GAO(^TTTAGTOCCCTOT 

AACCXXAAACTACAGAAATATTAOATA^ 

CTTTATGTTTTAAOGO<XSCAGATATCTTCPGTG(Xrc!PT^ 

CAAOTGTACGATCTGCATTTCCTT^ 

QCTOCriCTATCr^^ 

CX^XXTGCX-TAATTTTTOT 

GJUXCTATCTOACTATTTTTC^ 

**CCTCTAGATAQGUU3AG(jTTTAGC^ 

CUUtfaUX^TCATTACS^^ 

GCCTGACTCTCTA^^ 
<™GCAGKXOTAC^^ 

CCAG^^ 

CCTGTGAATGCAW^TAGTAAAGACAAAAACAAAATAG^ 

«XXMCX»CTCTACIVGC^^ 

TCTGCCTAATGACrTCACAC^^ 

<X»QTt^TTTATGTTATOCCXKa^ 

OCATOMXXMCATO^^ 

TCTOACCTAUUVGM^ 

AAAATAGCTAIUaCPa^TTATOlU^^ 

GVWMnATAOGTTCATTTTGTTCK^ 

TCCAGCaWWSCTAGCTCTTGO^^ 

TCWC3Matf»AGATOA1OT 

TCTAOAJJGKffAGAOATAAAATOKTCU^ 
ATTACATCSMXATATIOTCO^^ 
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TACAGGCX^VAAGAAGACXXTTCCAG 

ATCAXAawnXKrTTCACTGTTTTTAAACT 

ATGACJtfyUU^XTCATCAGGTTGTAAATO 

GACTTCTCnrrTTACACATTTTTATATGGA 

AAATOTPTOACCXXCTGTTCATAA^ 

TGTOIK?rGTCTATATACATATATAT<5AGAAATGCAAA^ 

TTCTAATATCTTTGAAACTTCT^ 

AAGTGCnTCACAATCACCTATTTACTTAATCCT 

TATAAAAGAACAAAATAAATGAAAGCTGCCCTCTCCTAC^^ 

CXrTCCTAGAAAATAAAATTACATGCATTAATATATGTGTOT 

GTG<^TTTTrAATAGCTGTT^ 

G(XXnN3GCTCATGCXrr^ 

AACATOGCGCAACCCCATCTCTACrAAAAGTACAA^ 

AOOCAOGU^GAATCACTTGAACCCOGGAGGCAC^ 

TCTCTTCTCAAAAATAAATAAATTAAATA^^ 



GTCXXACAAAAGAAATACXACTCGAATATAAAAW 

TCCCCTATTGGCTACGGTTAGACCGCACAGGCTAiSACT 

AAAAATGGTTATGACAGACXlATfCT 

GAATCAGGtrTGCLftCXXTTCTAATC 

TACTC^TTCTTTGGAAAGAAATTTAGAAOT 

CTCiyCfcAGCATACTGGATTCA^ 

ACCTCTCGXXn'TCAAGCAAT^ 
GTATrrTTACTAGGCtfUXI*G^ 
AGnXX^rCXSXJATTAOVGGCATGAACCA 

AATTcexn?AAi M ivm:^^^ 

JUUUKrTTTTTTGAAJUTC^ 

C Cl w iTCX Xrrr CTCCAGACTAAACTCACT 

CXAXQCTTCTGCATTTT^ 

ACJUGAACXWKCMIAAATC^ 

TCMtf»AATTTGTAAATAAAGQACCCA^^ 

XG<X3ACTAGATTCCT<WA1^^ 

QIUGaxnX^Utf^AGACCCTTTGC^ 

GQOQqqqCTCAGGCAGTCATCSCTTGCTC^ 

JUITTCIWATTATAATTTACTACCAAOT 

JUVO\ACAAAAA(XATGGTCACT^^ 

GCKX3TTCAGUUQAAAACXX5AGGA 

TOrCTACCTCCTGAGTTCAAGCX^ 

nOTATTTTTJUITAGACSATOOO^ 

AIUXXSCTCXXUWTAOWC^^ 
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TTAAACTATCATTAGATACACCACAAT^ 

TATTITTCCAAACTCTTAACCATAAACATAA^ 

AAGTTC^CCATTOTAAATTGGGCACCATAT^ 

TAACATATTACTTTTATTTAATCAC^^ 

TGCAAATTAATGCATT<X^GACXrrT^ 

TTAAATAAAAAAAT<^GTTTAATAGAG^ 

CTTTCTCTCTTIWIT^ 

GCAACCTKX^CTTCCCAG(mX3^^ 

CTTTCnTGTATTTTTACTAGAC^TGCX 

TCAGTCK^CXTCCGCC^^ 

GQCXAAATTTTTOTATTTTTACffl 

CXSGCCTCCCAAAGTCCTGOGATTACAGG^^ 

TTGGGCTAGTGAGGAAGTCAGGTTXCACGGGCC^^ 

COCTCTCCCGACCTC^ 

TCX^CTCTTGTGCAAATTTTCTGCCATG^ 

TTGAGLACCAGKX!TGGCCAACAT<K5C^AAA 

<^ACTCAGGAGGCTGAGGCAAGAGA^ 

JUa*GOGAGACTCTCTCOT<»A 

AACTGk^TGACTCXyVGGACTCTCT^^ 

ATAAAATGTCTGAAATATCAATGATT^ 

TrTATAWSACAAAAATAGACACTTA^ 

TTTCTTTCTACXXyVGCTAAA^ 

ACCWMTAATCAATATGCACCAGAT^ 

AAAATGTAAAGAATOTCK^AATGTGCTC^ 

YAATAGGCXaUXXATGCTCXXrrCA^ 

CXZUUXX^TQGMXAACATOQTQA^ 

T00KUUXX3X3AGGC^^ 

ACAGAOCaW3ACTTTGTCTCAGCaAA 

TCC?TACTC^TATCATGnK»XrrATATACATATA^ 
OTCAATCTAOAATTyCTAAACYTAAAAACATGCT 

«xx»OGCoaATa»coAccrrcAaa 

TTAGCTGE3TCAT(X3TCX5CGCQTQC^ 

CM3TGA<XXS3A<»TC^^ 
CAtGAAJUSQGCJU^TCTCTOaCZ^^ 
CO*GCUUOTX33AGAC(^^ 
GTAGTCCCACCTACTCGOCUGOT 



GTCAAATGITAATTTGACTATGTAACITATTA^ 

«CTTACAGTC»AGCAGAdACAC^ 

TTGCCATTATCTTTTCTATCA^ 

TTCATGCWCTCTCGCCCT^ 

CTTAGACAATAATTTAGTATCUU^^ 
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CCGGGCATGGTGGTGTGAGCCTGTAGTCCTAGCTAC^ 

GAGCTATGATCATCXXIATTGCACTCC^^ 

AAAGGAAATGCAGCCA'ITTTYTTTTTGKXn^ 

TTCTTTTGCAATATGGCTTTTCACTC^ 

CTATGTTGGTCTATCTAGGCATATCACWATWrATy 

TCATACXTXTTAATCAAATAATAATTTTCT^^ 

TTC^TTACTGATAAATATAAAATA1T0?TCCAAAACA 

AGCCKKOTCACTAT^^ 

CTTTTTAAGGGATAAATOCXU^^ 

(XXIATGTGGAATTTATAACAAAAGTTCT^ 

TGACACAGATCTAGACX^TATTAATAATTAC^ 

TAAGCXCTGGGACOTAACCCX^ 

ATCTQcrycKajuaaiQft^^ 

ATTTGAC 
TGTTAATTTACTTATGAATTC^^ 

GATTTAACGTAGTTC^TGGTCTTTAGAAAACAAGAAAGT^ 

GCTATTTCTGCTTAGyVtTrTA^ 

CAAATAAAGCAATTTGAGTTGAAAAT^ 

TOTTCCACCTGTAAAATGCTAA^^ 

(XrraOTAAATTATACKrrCTTATTAG?^^ 

ATAAAAATACATTCXX^GCTGGG^ 

GGAGTTTGAOACC^GCCTGGCTAATGTG^ 

AATCCCAGCTACTCAGOAGGCK^^ 

CCAGATTCAGTCAATAACTTTTTGCATAAC^ 

CTACAAATGC(XX!AQCSCTAG^ 

ATTGAAC^TGCAATATTCTGT^ 

ATCAGGCTGAGGCTAACAGTCATCTGAAGCCATATCCT^ 

TGCACCTGAGAGCATTCrm'ATAAACCAC^^ 

GCATTTCCCTCTTTTTTCAAAC^^ 

ACAAAATTAAAAACGATACAAAAAATACA 

CAAl^TCATTTCCAYTrrCTATGrCT 

AGTCTATTCATTACTGCAGACCACAOT 

GATTACACTTOAOOAAGCTAAAATAA 

TTClAAOAATAAAACrTGAACAAAAATTTTAGCA ^ 

CAAAOX»CTTCT»AT^ 

GttmACTGGGTGTGA^^ 

TCTTTCCCC7L<nT?TTT^ 

AACIX^AAGOVGTATTTCTCTA/^^ 

CACCATGrTAAAGAAAAOTACAAACCCATAAAATA^^ 

ACGGTTGTGGTCXXrrGGTTGTATAATAGTTACAT^ 

TOTATCTTATACTCACAGAAAGAGAAOGGAAAAATTT^ 

TAAATGATTCAAATCAAAGTCI^yitX^ 

TTOATTCTr(XX^AATCACACCTACAACCA 
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GTGGC^CAGACACTAATACTCAGCAAACA^ 

CTGCMOTGGAGCanTGTAATG^ 

TTACCTCACX^CAAACXX^CXXXrrATCCTGACrrA^ 

ATAATCCTTCTTCTGTAGaTAOTTCCAAOa^ 

OCCyTCCTCTAAyyfCACACTCJUlGATC 

AGC^TTTTAAQOAM^TlttT^^ 

AAAACAAAACrrGTTTCAAAATACXnXX3AAAA<A 

TCAGATTTGCAAATTAAAAAAAAACXXI&GG^ 

ATAATTCSGfTTTTGCATTTTTGAAAGCTATTTGATTATGC^ 

ATCAATCCTAAGGRfcTAAATCTAAATTTGAT^ 

GGQOGCAGlGGCTCATGCCtGTAATOOCAGCJ^^ 

AAC^CXXSAGAAACTKXXriCTCTACTAAAATAAA 

AGQau&AGAATGcxxnx^Gcra 

CTCTCnTTAAAAAAAAAAAAAAAAAAAAAAATATATATAT^^ 

TCrTWTAAAAACAA^CJAATGATTAAGTAKATTATC^ 

ATAATGACTTCOTAACrrACTTTAACUVAGAGCT^ 

CTGACGCTCAGTGTCCAAGGTCCAATGAACT^ 

*CACACTGOkOGGX?n^CCTGTTAAGA 

ACATTTTAAATTATTCATGAaACCr^ 

TACTTCTGAATTTTGGAGCXXH^^^ 

CXJAACTGGTACTCTGAAAGAGAAAATGAGAAAT^ 

GATAGACACACACACyiCACACACACACACACACA(^ 

CCCTCTCTTCCTCTCOT^^ 

TIWTTTTTWGAGACAOAOTC^ 

'CAAATGATTCTTGTGCCTCa^GCCT^^ 

AC3UXX^TGaCCCACCACACTG^^ 

ACAATCAAAGTCTATAGGTTQOAaAOa^^ 

VUCIGMXXSltSCTQQQAAG^ 

AOGMGACCAGCTCAGC^^ 
AAOOQCTCAGGTCTAGAGTCWX*^^ 
TTMSCCTCMATCRCTC^^ 
GGAOAGItGGAGAGTCTTGCTT^ 
AC1TCTTCAGGGGCAATATACCCTCTTCT 
TAGCACTCAACranX?TAAGTTG^CATT^ 
CCAGCXWTCTAjCXXiS^^ 
AAATTAATTAATAAACACGTCrTAAATO^ 
CmyiGGGCTCTGAGTGCTGCXXrrG^ 
TGGTGG?TCCCCCA£mx^ 
-ACTTCCAATGGCAATGGGAATTAAGA 
AOOCAATCn^AATGTTTTTCTTTTTT 
TATCCATAATCCCACCATCTTAACAC^^ 
AATACMtTTA CTlt» rr i AOAQGTTAra 
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CAATGTCTCX^CTACC^^ 

CX^CCACCACACTTC^AATTTT^ 

AGTGATCCTTCCACCTCAGCCTCCXrCAAGGGCT^ 

GTAATAAATGAAAATT^ 

ACTGAATTTCTTTTTTGTGCACCCTA 

CCTGO^GTl-rAAAGAaw.aWACT 

TCK3GAACTTTCTTCTAGTTATCT 

TCATA(XTT<^XXXXXXTC^ 

CTCTTACCATCATCATGATCGrTGRAA 

CTGTGSACX^TTTCTTCAAATTGTTAAGAA 
CAACAGCAOGTATTCTATTACCCATCCT^ 

ATCCTQAAATJUUUUIA*^ 

TAAC^AAATCOTCTTCC^ 

TTTTGTAAAGACC^TACGTT^ 

AG™K»CAATTT^^ 

GAAACIX^ACTCGTTAAG^ 

CTCTAQCTACWK^^^ 

TCCATGTTGTTGCATGTCTC^^ 

GCACAACAGCGX^TGTCTTTTCAC^ 
«XAAC»CAATAATAOGAGAGCTATGT^^ 

ooocAAcwiAocxaaACxcr^^ 

TTAAAOAACrTITCAC3C!CAGtrCTTCl\TCT 
TA*CgCXaMAlCAACftTA^ 

TAC ^ ATTQC ^^ 

CPGCT(3AGTAQC!FGGGAC?A 

CTGGTCTCAAACTCCCAGGCXCA^ 

«K*ACTAATAG^^ 

TOACAAAAAAATAAOTOAWJUyaAGAACCT 

<X5CTCAAAACAGCTAAATG^^^ 

TTGGCCAAGATCCTTAATCTCA<n , C?^ 

1H^OTTACCCGAGAT<^CTGCGGTTTTAA^ 

CTCATCCTGKXXTIGACa^^ 

ATTOTTCTATTTTATTATTAGTTATTQT^^ 

OAAAAAAACATOOTATOTATAAOtn-TOW 

CXaAAACTACTOTAAAAGTTTOTCTTTTATAOACTAC^ 
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AAGTTTACCTACATGTTTGAATTTOT 

AAACTGAAAAAGTTGCAAGATACTT<X^ 

ATTACX^GGTCATCCCTCTTTAAAATGTACCAAAT^ 

CAAGTTTTTCTTTAACy\ATGAGAAATTTTATC^ 

TAATGTAGTATATTTTTATGCOTAAAT^^ 

GACATCAAATTCPACOTATGAAATATTTAT^ 

TATACTACAAATTTCTATOACTACT^ 

TCCAATAAGrTFAACCTATCGACTAATAAT^ 

TTTTTATAGATGCAAGCACTGTGGCACCTACTG^ 

TCTTCCCAGTATTCAGGTCl^^ 

TGTTTACrTGCTATACCCATTAATTTACTATCATCACA 

TCXXyiGAATATCAGTTCTATGAAGAAAAGCWC^ 

GQCATATGGAAAAACGCTTGACrraU^ 

TTACTATCAAGO^A^TACTCAAOGAAGGCAGCAG 

GGTATTTGGGTGAAACAAATAAACCAGCCn*CAAAATA^ 

GCTCGACMCAGGC^GATTACTTCAGGT^ 
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AGTTGCGTCCCTCTCTGTTGCCAGGCTGGAGTTCAGTGGCATGTTCATAGCTC 

ACTGAAGCCTCAAATTCNTGGGTTCAAGTGACCCTCCTACCTCAGCCCCATGA 

GGACCTGGGACTACAGTTCCCTCCCTTTGGAACGCAGCGTGGGCACCTGCAA 

CGCAGAGACCACTGTATCTCCGGTGCAGAATGTAATGAGTGCCTGATACATT 

TGCCGAATAAACTATTCCAAGGGTTGAACTTGCTGGAAGCAANAGAAGCACT 

ATTCTGGTAACAGCGGGAACATGAAGCCGCCACTCTTGGTGTTTATTGTGTGT 

CTGCTGTGGTTGAAAGACAGTCACTGCGCACCCACTTGGAAGGACAAAACTG 

CTATCAGTGAAAACCTGAAGAGTTTTTCTGA 
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agttgcgtccctctctgttgccaggctggagttcagtggcatgttcttagctc 

actgaagcctcaaattcctgggttcaagtgaccctcccacctcagccccatga 

ggacctgggactacagatggagtcttgctctcgttgcccagactggagtgca 

ctgctgcgatctcagctcactgcaacctctacctcccaggttcaagcgattct 

cctgcctcagcctctcgagtggctgggactatagtaacagcgggaacatgaa 

ck:cgccactcttggtgtttattgtgtgtccgctgtggttgaaagacagtcact 

gcgcacccacttggaaggacaaaactgctatcagtgaaaacctgaagagttt 

TTCT 
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CTTGGAGTGAACTGAOrrGTGGACTOAAA^ 7 9 

M K L 3 

CACACTC^PGACTTAACrTTATTCT^ ATG AAG CTG 153 

PLLMFPVCLLWLKDCHCAPT 23 

CCA CTT TTG ATG TXT CCC GTG TGT CTG CTA TGG TTG AAA GAC TGT CAT TGT GCA CCT ACT 213 

WKDKTAISENAWSFSEAGEI 43 

TGG AAG GAC AAA ACT GCC ATC AGT GAA AAC GCG AAC AGT TTT TCT GAG GCT GGG GAG ATA 273 

DVD GEVK IACIGIKQMKIMM 63 

GAC GTA GAT GGA GAG GTG AAG ATA GCT TTG ATT GGC ATT AAA CAG ATG AAA ATC ATG ATG 333 

E R R B E E H S K L M K T L. K K C K E E 83 

GAA AGO AGA GAG GAA GAA CAC AGC AAA CTA ATG AAA ACC TTG AAG AAG TGC AAA GAA GAA 393 

KQBALK LMNEVHEHL EEEES X03 

AAG CAG GAG GCC CTG AAA CTT ATG AAT GAA GTT CAT GAA CAC CTG GAG GAG GAA GAA AGC 4S3 

LCQVSLADS W D ECRACLESN 123 

TTA TGC CAG GTT TCT CTG GCA GAT TCC TGG GAT GAA TGC AGG GCT TGC CTG GAA AGT AAC 513 

C M R F D T T CQ P AW S & V K N M V E 143 

TGC ATG AGG TTT GAT ACC ACC TGC CAA CCT GCA TGG TCC TCT GTG AAA AAT ATG GTG GAA 573 

QFFRK1YQFLFPI*QEN DRSG 163 

CAG TTT TTC AGG AAG ATC TAT CAG TTT CTG TTT CCT CTC CAG GAA AAT GAC AGA AGT GGC 633 

PVSKGV TEEDA QVSHIEHVF 183 

CCT GTC AGC AAA GGG GTC ACT GAG GAA GAT GCG CAG GTG TCA CAC ATA GAG CAT GTG TTC 633 

SQIiSADVTSLFNRSLYV FKQ 203 

AGC CAG CTG AGC GCA GAT GTG ACA TCT CTC TTC AAC AGA AGC CTT TAC GTC TTC AAA CAG 753 

InR REFDQAFQSYFTSGTDVT 223 

CTG CGG CGA GAA TTT GAC CAG GCT TTT CAG TCA TAT TTC ACA TCG GGG ACT GAC GTT ACA 813 

gppFFPSbSKBPAYRADA.BP 243 

GAG CCT TTC TTT TTT CCA TCT TTG TCC AAG GAG CCA GCC TAG AGA GCA GAT GCT GAG CCA 873 

S H A X PNVFQI*I»CNI*-S FSV YQ 263 

AGC TGG GCC ATT CCC AAT GTC TTC CAG CTG CTC TGC AAC TTG AGT TTC TCA GTT TAT CAA 933 

S V S B K LI TTL R AT B D P P K Q D 283 

AGT GTC ACT GAA AAA CTC ATC ACA ACC CTG CGT GCC ACA GAG GAC CCT CCA AAA CAA GAC 993 

KDSEfQGG PISKXXfPBQD RGS 303 

AAA GAC TCC AAC CAG GGA GGC CCC ATT TCA AAG ATA CTA CCT GAG CAA GAC AGA GGC TCA 1053 

DO Kl*G Q N L S D CV N F R KR C Q K 323 

GAT GGG AAA CTT GGC CAG AAT TTG TCT GAT TGC GTT AAT TTT CGC AAG AGA TGC CAG AAA 1113 

CQ DYX.SDDCPNVPEI,YREI*N 343 

TGC CAG GAT TAT CTA TCT GAT GAC TGC CCT AAT GTG CCT GAA CTA TAC AGA GAA CTC AAT 1173 

E A I. R h V S RSNQQYDQVVOMT 363 

GAG GCC CTC CGA CTG GTC AGT AGA TCC AAT CAG CAA TAC GAC CAG GTG GTG CAG ATG ACC 1233 

QYHL.EDTTli I'MEKMREOFGW 383 

CAG TAT CAC CTG GAA GAC ACC ACG CTT CTG ATG GAG AAG ATG AGA GAG CAG TTT GGC TGG 1293 

VSEliAYQSPO AEDIFNPVKV 403 

- GTT TCT GAA CTG GCA TAC CAG TCC CCA GGA GCT GAG GAC ATC TTT AAT CCA GTG AAA GTA 1353 
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MVALSAHBGNSSDQDDTVVP 42 3 

ATG OTA GCC CTA AGT OCT CAT OAA GGA AAT TCT TCT GAT CAA GAT GAC ACA OTG GTT CCT 14X3 

SSLlfPSSNFTLSSP L» E K S A G 443 

TCA AGC CTC CTG CCT TCC TCT AAC TTC ACA CTC AGC AGC CCT CTT GAA AAG AGT GCT GGC 1473 

NANFIDHVVEKVL QHFKEHF 463 

AAC GCT AAC TTC ATT GAT CAC GTG GTA GAG AAG GTT CTT CAG CAC TTT AAG GAG CAC TTT 153 3 

K T W * 46*7 

AAA ACT TGG TAA 1S4S 

GAAGATTTAGTCC^TCCTATAATCAGCAAGAATTAC^ 1624 

TAACACAATGAACACAGCTGCATGAAAGTTAGG 1703 

AATAGCTATTCAAATTGAGTTAATATAAAA^ 1782 

TTCTTTGTATACTAAATAAATACTGAGTCCCCT XB IB 
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CTrGGAGTCAACTGAGTGTGGACTCAAA 79 

M K L 3 

C"ACACTCTGACTTAACrrTTATTCTO ATG AAG CTG 153 

PX*. LMFPVCLLWLKDCHCAPT 23 

CCA CTT TTG ATG TTT CCC GTG TGT CTG CTA TGG TTG AAA GAC TGT CAT TGT GCA CCT ACT 213 

WKDKTAI SE M AN S F S BAG E I 43 

TGG AAG GAC AAA ACT GCC ATC AGT GAA AAC GCG AAC AGT TTT TCT GAG GCT GGG GAG ATA 273 

D V D G EVKIAltXG I K QMK IM M 63 

GAC GTA GAT GGA GAG GTG AAG ATA GCT TTG ATT GGC ATT AAA CAG ATG AAA ATC ATG ATG 333 

ERREEEffSKIiMKTI* K K C K E E 83 

GAA AGG AGA GAG GAA GAA CAC AGC AAA CTA ATG AAA ACC TTG AAG AAG TGC AAA GAA GAA 393 

KQBALKLMNEVHEHLEEEES 103 

AAG CAG GAG GCC CTG AAA CTT ATG AAT GAA GTT CAT GAA CAC CTG* GAG GAG GAA GAA AGC 453 

LCQVSLADSWD E CR A CI* E S N 123 

TTA TGC CAG GTT TCT CTG GCA GAT TCC TGG GAT GAA TGC AGG GCT TGC CTG GAA AGT AAC S13 

CMRFDTTCQ PAWSSVKNMEN 143 

TGC ATG AGG TTT GAT ACC ACC TGC CAA CCT GCA TGG TCC TCT GTG AAA AAT ATG GAA AAT 573 

DRSGPVSKGVTEEDAQVSHI 163 

GAC AGA AGT GGC CCT GTC AGC AAA GGG GTC ACT GAG GAA GAT GCG CAG GTG TCA CAC ATA 633 

BHVFSQX-SADVTSL F NRSLY 183 

GAG CAT GTG TTC AGC CAG CTG AGC GCA GAT GTG ACA TCT CTC TTC AAC AGA AGC CTT TAC 693 

VFKQ I/RREFDQA FQ S Y FTSG 203 

GTC TTC AAA CAG CTG CGG CGA GAA TTT GAC CAG GCT TTT CAG TCA TAT TTC ACA TCG GGG 753 

TDVT EPFFFPSLSK E PAYRA 223 

ACT GAC GTT ACA GAG CCT TTC TTT TTT CCA TCT TTG TCC AAG GAG CCA GCC TAC AGA GCA *13 

DAB.PSWAXPKV'FQltLCNLSF 243 

GAT GCT GAG CCA AGC TGG GCC ATT CCC AAT GTC TTC CAG CTG CTC TGC AAC TTG AGT TTC 873 

SVYQSVSBKI.ITT I. RATBDP 263 

TCA GTT TAT CAA AGT GTC AGT GAA AAA CTC ATC ACA ACC CTG CGT GCC ACA GAG GAC CCT 933 

PKQDKDSKQGGPISKILPEQ283 

CCA AAA CAA GAC AAA GAC TCC AAC CAG GGA GGC COG ATT TCA AAG ATA CTA CCT GAG CAA 993 

D R G S D G K X» G Q M 1* S D C V N F R K 303 

GAC AGA GGC TCA GAT GGG AAA CTT GGC CAG AAT TTG TCT GAT TGC GTT AAT TTT CGC AAG 1053 

RC QKC QDYIiSDDCPNVPELY 323 

AGA TGC CAG AAA TGC CAG GAT TAT CTA TCT GAT GAC TGC CCT AAT GTG CCT GAA CTA TAC X113 

RE Zi H E A L R I* V S R S H Q Q Y D Q V 343 

AGA GAA CTC AAT GAG GCC CTC CGA CTG GTC AGT AGA TCC AAT CAG CAA TAC GAC CAG GTG X173 

V Q M T Q V K I* E D T T Ii l* M E K M R E 363 

GTG CAG ATG ACC CAG TAT CAC CTG GAA GAC ACC ACG CTT CTG ATG GAG AAG ATG AGA GAG X233 

Q FGWVS EI*AYQS PG A ED X FN 383 

CAG TTT GGC TGG GTT TCT GAA CTG GCA TAC CAG TCC CCA GGA GCT GAG GAC ATC TTT AAT 12 93 

PVKVMVALSAHEGNS S D QDD 403 

CCA GTG AAA GTA ATG GTA GCC CTA AGT GCT CAT GAA GGA AAT TCT TCT GAT CAA GAT GAC 1353 



FIG. 8A 



WO 99/47535 



32/57 



PCT/US99/05606 



TVVPSSI^LPSSNFTLSSPLE 423 

ACA OTG GTT CCT TCA AGC CTC CTG CCT TCC TCT AAC TTC ACA CTC AGC AGC CCT CTT GAA 1413 

KSAGNAHFIDHVVE KVLQHF 443 

AAG AGT GCT GGC AAC GCT AAC TTC ATT GAT CAC GTG GTA GAG AAG GTT CTT CAG CAC TTT 1473 

KEHFKTW + 4S1 

AAG GAG CAC TTT AAA ACT TGG TAA 1497 

GAAGATTTAaTCCATCCrATAATCAGCAAGAATTACACCT 1S7S 

TAACACAATGAACmCAGCTGCATGAAAGTTAGGTATATAT^^ 1 6 5 S 

AATAGCTATTCAAATTGAGTTAATATAAAAAT^ 1734 

TTCTTTGTATACTAAATAAATACTGAGTCCCCT 1767 
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CTTCGAGTCAACTGAGTGTC^ACTGAAACTTCCAAAAACIXS 1 9 

M K L 3 

CACACTCTGACTTAACTTTATTCTGTGG^ ATO AAG CTQ 153 

PLLMFPVCI-LWLKDCHCAPT 23 

CCA CTT TTG ATG TTT CCC GTG TGT CTQ CTA TGG TTG AAA GAC TGT CAT T6T GCA CCT ACT 213 

WKDKTAISENANS F S E A G E I 43 

TGG AAG GAC AAA ACT GCC ATC AGT GAA AAC GCG AAC AGT TTT TCT GAG GCT GGG GAG ATA 273 

D VDG EVKIALIGIKQM KI MM 63 

GAC GTA GAT GGA GAG GTG AAG ATA GCT TTG ATT GGC ATT AAA CAG ATG AAA ATC ATG ATG 333 

ERREEEHSKLMKTLKKCKEE 63 

GAA AGG AGA GAG GAA GAA CAC AGC AAA CTA ATG AAA ACC TTG AAG AAG TGC AAA GAA GAA 393 

K Q E A Xi K I* M N E V H E H X* . E E E E S 103 

AAG CAG GAG GCC CTG AAA CTT ATG AAT GAA GTT CAT GAA CAC CTG GAG GAG GAA GAA AGC 453 

LCQVSLADSWDECRACLESN 123 

TTA TGC CAG GTT TCT CTG GCA GAT TCC TGG GAT GAA TGC AGG GCT TGC CTG GAA AGT AAC 513 

CMRFDTTCQ : PAWSSVKWM EP 143 

TGC ATG AGG TTT GAT ACC ACC TGC CAA CCT GCA TGG TCC TCT GTG AAA AAT ATG GAG CCA S73 

A Y R ADA E P S WA I P K V P Q h L C 163 

GCC TAC AGA GCA GAT GCT GAG CCA AGC TGG GCC ATT CCC AAT GTC TTC CAG CTG CTC TGC 633 

M It S PS V Y 0 SVS E K L I TT L RA 183 

AAC TTG AGT TTC TCA GTT TAT CAA AGT GTC AGT GAA AAA CTC ATC ACA ACC CTG CGT GCC 693 

TEDPPKQDKDSNQGGPISKI 203 

ACA GAG GAC CCT CCA AAA CAA GAC AAA GAC TCC AAC CAG GGA GGC CCG ATT TCA AAG ATA 753 

L P E QDR G S DGKXj G Q N L S D CV 223 

CTA CCT GAG CAA GAC AGA GGC TCA GAT GGG AAA CTT GGC CAG AAT TTG TCT GAT TGC GTT 813 

K FRK RCQ K C Q D Y I, S D » C P N V 243 

AAT TTT CGC AAG AGA TGC CAG AAA TGC CAG GAT TAT CTA TCT GAT GAC TGC CCT AAT GTG 873 

P EL YBEbHEALRKiV SR SNQ Q 263 

CCT GAA CTA TAC AGA GAA CTC AAT GAG GCC CTC CGA CTG GTC AGT AGA TCC AAT CAG CAA 933 

Y D 0 v V Q H T Q Y H L B D * T T I* X» . M fi 283 

TAC GAC CAG GTG GTG CAG ATG ACC CAG TAT CAC CTG GAA GAC ACC ACQ CTT CTG ATG GAG 993 

KMRBQ FGWVSE LAY QS PG AR 303 

AAG ATG AGA GAG CAG TTT GGC TGG GTT TCT GAA CTG GCA TAC CAG TCC OCA GGA GCT GAG 1053 

DXFNPVKVMVALSAKEGH5S 323 

GAC ATC TTT AAT CCA GTG AAA GTA ATG GTA GCC CTA AGT GCT CAT GAA GGA AAT TCT TCT 1113 

DQDDTVVPSSLLPS SKFTLS 343 

GAT CAA GAT GAC ACA GTG GTT CCT TCA AGC CTC CTG CCT TCC TCT AAC TTC ACA CTC AGC 1173 

SPIiEKSAGNANFIDHVVEKV 363 

AGC CCT CTT GAA AAG AGT GCT GGC AAC GCT AAC TTC ATT GAT CAC GTG GTA GAG AAG GTT 1233 

LQHFK EHFKTW* 37S 

CTT CAG CAC TTT AAG GAG CAC TTT AAA ACT TGG TAA 1269 

GAAGATTTAGTCCATCCTATAATCAGCAAGA^ 134 8 
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TAACACAATGAACACAGCTGCATGLkAAGTTAGGTAT^ 1427 
AATAG CTATTCAAATTG AGTTAAT ATAAA^TTT CTTCCT AAAAAG T AAAATG T ACATATGTAG AAT ATG ATGC ATT AG 1506 
TTCTTTGTATACTAAATAAATACTGAGTCCCCT 153 9 
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CTTGGAGTCAACTGAGTGTGGACTGAAA 79 

M K L 3 

CACACTCTGACTTAACTTTATTCTCTGGAC^ ATG AAG CTG 153 

PLL MFPVCLL WLKDCHCAPT 23 

CCA CTT TTG ATG TXT CCC GTG TGT CTG CTA TGG TTG AAA GAC TGT CAT TGT GCA CCT ACT 213 

W K D KTA I S EHAN S F S BAG E I 43 

TGG AAG GAC AAA ACT GCC ATC AGT GAA AAC GCG AAC AGT TTT TCT GAG GCT GGG GAG ATA 273 

DVDGEVK1A LIGIKQMKI MM 63 

GAC GTA GAT GGA GAG GTG AAG ATA GCT TTG ATT GGC ATT AAA CAG ATG AAA ATC ATG ATG 33 3 

ERREEKHSKLMKTIiKKCKEE 83 

GAA AGG AGA GAG GAA GAA CAC AGC AAA CTA ATG AAA ACC TTG AAG AAG TGC AAA GAA GAA 393 

K Q E A h K I* M N E V H E K L E E E E S 103 

AAG CAG GAG GCC CTG AAA CTT ATG AAT GAA GTT CAT GAA CAC CTG GAG GAG GAA GAA AGC 453 

I,C QVSI»AD SWDECRACIjESN 123 

TTA TGC CAG GTT TCT CTG GCA GAT TCC TGG GAT GAA TGC AGG GCT TGC CTG GAA AGT AAC S13 

CM RFDTTCQPAWSSVKNM PA 143 

TGC ATG AGG TTT GAT ACC ACC TGC CAA CCT GCA TGG TCC TCT GTG AAA AAT ATG' CCA GCC 573 

Y R ADA E P S WA 3 P N V F Q I* I* C M 163 

TAC AGA GCA GAT GCT GAG CCA AGC TOG GCC ATT CCC AAT GTC TTC CAG CTG CTC TGC AAC 633 

kSFSV YQSV SE KLI TT1.RAT 1B3 

TTG AGT TTC TCA GTT TAT CAA AGT GTC AGT GAA AAA CTC ATC ACA ACC CTG CGT GCC ACA 693 

BD'PPK'QDXDStfQGGPISK-II* 203 

GAG GAC CCT CCA AAA CAA GAC AAA GAC TCC AAC CAG GGA GGC CCG ATT TCA AAG ATA CTA 753 

PEQDRGS DGKLG QNLSDCVN 223 

CCT GAG CAA GAC AGA GGC TCA GAT GGG AAA CTT GGC CAG AAT TTG TCT GAT TGC GTT AAT 813 

FR KR CQK CQDYLSDDCPNVP 243 

TTT CGC AAG AGA TGC CAG AAA TGC CAG GAT TAT CTA TCT GAT GAC TGC CCT AAT GTG CCT 873 

E r*YRELNEALRI*VSRSMQCY 263 

GAA CTA TAC AGA GAA CTC AAT GAG GCC CTC CGA CTG GTC AGT AGA TCC AAT CAG CAA TAC 033 

DQVV QM TQYH1#EDTT I»I*H EK 283 

GAC CAG GTG GTG CAG ATG ACC CAG TAT CAC CTG GAA GAC ACC ACQ CTT CTG ATG GAG AAG 593 

MREQFGWVSELAYQ S PGAE D 303 

ATG AGA GAG CAG TTT GGC TGG GTT TCT GAA CTG GCA TAC CAG TCC CCA GGA GCT GAG GAC 1053 

IFNPVKVKVALSAHBGWSSD 323 

ATC TTT AAT CCA GTG AAA GTA ATG GTA GCC CTA AGT GCT CAT GAA GGA AAT TCT TCT GXT 1113 

Q D D TV V P S S h h P S S N FT I* S S 343 

CAA GAT GAC ACA GTG GTT CCT TCA AGC CTC CTG CCT TCC TCT AAC TTC ACA CTC AGC AGC 1173 

PI.EKSAGNAUFXDHVVEKVI, 363 

CCT CTT GAA AAG AGT GCT GGC AAC GCT AAC TTC ATT GAT CAC GTG GTA GAG AAG GTT CTT 1233 

OHFKEHFKT W* 3*74 

CAG CAC TTT AAG GAG CAC TTT AAA ACT TGG TAA 1266 

GAAGATTTAOTCCATCCTATAATCAGCAAG 1345 
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TAACACAATG AACACAGCTG CATC CACT ATTGG TTTACTTTGTTG AATGGAAG TTT 1424 
AATAGCTATTCAAATTGAGTTAATATAAAAATTTCTTCCTA^ 1503 
TTCTTTGTATACTAAATAAATACTGAGTCCCCT 153 6 
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GCAACCTCGTTGOTGAGAGCCTGCAGTTAOTGTCACG<3CGGAAAC ATG AAG CCG CCA CTC TTG GTG TTT 69 

IV YLLRLRDCQCAPTGKDRT 28 

ATT GTO TAT CTG CTG CGG CTG AGA GAC TGT CAG TGT GCG CCT ACA GGG AAG GAC CGA ACT 129 

S I R E D P K G F S K A G E I DVDE E 48 

TCC ATC CGT GAA GAC CCG AAG GGT TTT TCC AAG GCT GGG GAG ATA GAC GTA GAT GAA GAG 189. 

VK KALI GMKQMKILM ER RE E 68 

GTG AAG AAG GCT TTG ATT GGC ATG AAG CAG ATG AAA ATC CTG ATG GAA AGA AGA GAG GAG 249 

EH SKLMR T LKKCR EE K Q E A L 88 

GAA CAT AGC AAA CTA ATG AGA ACA CTG AAG AAA TGC AGA GAA GAA AAG CAG GAG GCC CTG 309 

KLMNEVQEHLEEEER L CQVS 108 

AAG CTT ATG AAT GAA GTT CAA GAA CAT CTA GAA GAG GAA GAA AGO CTA TGC CAG GTG TCT 369 

X, M G S W D K C K S C L E S D * C M R F Y 128 

CTG ATG GGT TCC TGG GAC GAA TGC AAA TCT TGC CTG GAA AGT GAC TGC ATG AGA TTT TAT 429 

TT CQSSW SSMKSTIERVFRK 148 

ACA ACC TGC CAA AGC AGT TGG TCC TCT ATG AAA TCC ACG ATT GAA CGG GTT TTC CGG AAG 489 

I y Q P L F P F H E D D E K E L P ■ V * G E 168 

ATA TAT CAG TTT CTC TTT CCT TTC CAT GAA GAC GAT GAA AAA GAG CTT CCT GTT GGT GAG S49 

K p T EE D V Q L M Q I E N V P S Q L T 188 

AAG TTC ACT GAG GAA GAT GTA CAG CTG ATG CAG ATA GAG AAT GTG TTC AGC CAG CTG ACC 609 

VDV G FLYMMSFHV FKQMOQE 208 

GTG GAT GTG GGA TTT CTC TAT AAC ATG AGC TTT CAC GTC TTC AAA CAG ATG CAG CAA GAA 669 

PDLA FQ S YFMSDTDS MEPYF 228 

TTT GAC CTG GCT TTT CAA TCA TAC TTT ATG TCA GAC ACA GAC TCC ATG GAG CCT TAG TTT 729 

P PAFSKEPAKKAHPMQSWDI 248 

TTT CCA GCT TTT TCC AAA GAG CCA GCA AAA AAA GCA CAT CCT ATG CAG AGT TGG GAC ATT 789 

p S P PQ L PC H PS L S VY Q S V S A 268 

CCC AGC TTC TTC CAG CTG TTT TGT AAT TTC AGC CTC TCT GTT TAT CAA AGT GTC AGC GCA 849 

T V T E M L K A I E D L S K Q D K D S A 288 

ACA GTT ACA GAG ATG CTG AAG GCC ATT GAG GAC TTA TCC AAA CAA GAC AAA GAT TCT GCC 909 

HGGPSSTTWPVRGRGLCGBP 308 

CAC GGT GGA CCG AGT TCC ACG ACG TGG CCT GTG CGG GGC AGA GGG CTG TGT GGA GAA CCT 969 

GQ HSSBCLQPHARCQ K C Q D Y 328 

GGC CAG AAC TCG TCC GAA TGT CTC CAA TTT CAT GCA AGA TGC CAG AAA TGT CAG GAT .TAC 1029 

LW ADCPAVPELYTKADEA L'E 348 

CTA TGG GCA GAC TGC CCT GCT GTT CCT GAA CTA TAC ACA AAG GCG GAT GAG GCC CTT GAG 1089 

LVH ISHQQYAQVLQMTQHHL 368 

TTG GTC AAC ATA TCC AAT CAG CAG TAT GCC CAG GTA CTC CAG ATG ACC CAG CAT CAC TTG 1149 

BDTTYLMEKMREQFGWVTEL 388 

GAG GAC ACC ACG TAT CTG ATG GAG AAG ATG AGA GAG CAG TTT GGT TGG GTA ACA GAG CTG 1209 

ASQT PGSEKIFSFXKVVPGV 408 

GCC AGC CAG ACC CCA GGA AGC GAG AAC ATC TTC AGT TTC ATA AAG GTA GTT CCA GGT GTT 1269 

"m E a n F S KO DEKM I D I SI LPS 428 
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CAC GAA GGA AAT TTC TCC AAA. CAA OAT GAA AAG ATG ATA GAC ATA AGC ATT CTQ CCT TCC 1329 

SN FTLTIPLEESAESSDPIS 448 
TCT AAT TTC ACA CTC ACC ATC CCT CTT GAA GAA AGT GCT GAG AGT TCC GAC TTC ATT AGC 138 9 



G GAG AGT ATTTG ATTAGGG ACGTTTGCTG ATAGG AATAG ATGG TT CTTAAAAGGG AAAAATGACAAAACTAG CTTTTG A 1522 

ATACCTTG AAAACTGTATTC!AACCTCAT^AATAATCAAA^ C ATG AAAACTAAG ACAAGTT AGCAGT Tf TT ACCT ATTG A 16 0 1 

ATTTTCAAATTAAAAAAAAAAATCCT^ATAGA^ 1680 

GTTTTGTCTTTTTG AAAAGTTATTCAATTA^ 1759 

TGGAATCAATCCZAAAX3GAGTAAATCTAAAATTGAATTGAAGTT 183 8 

TAGTAAACTGTTAAAAA<n^AATGTCATCTGAATGTCTAA 1911 

AAATATCTTATAAAACCATTAAAAATATTTATA 1936 

AAGCCTATCTATAAGGCAAATATTATTAT^^ 20 75 

AATACnWKXSCAGGTTGCrrATTTCCrTCTCCAGGAAA 2233 

GGTAGATGCITTACn'ATCrGAGCAACCA 2312 

CTCTGAATCATAATTGCCAC7VTTAAACTG 23 91 

AAACTCrrATAAATGTAATGATCAAAACGAAAAAAAATCTACAATCTGCATTAA 24 64 



YM fcA KAVQHFKEHFK SW* 
TAG ATG CTG GCC AAA GCT GTA CAG CAT TTT AAG GAA CAT TTT AAA TCT TGG TAA 



466 
1443 
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CAGAAGCTGKSTGGCAACCTCGTTGGTGAGAGCCTGC^ ATG AAG CCG CCA CTC 72 

LVFXVYLLRLR.DCQCAPTGK 25 

TTG GTG TTT ATT GTG TAT CTG CTG CGG CTG AGA GAC TGT CAG TGT GCG CCT ACA GGG AAG 132 

DRTSIR EDPKGFSKAGEIDV 45 

GAC CGA ACT TCC ATC CGT GAA GAC CCG AAG GGT TTT TCC AAG GCT GGG GAG ATA GAC GTA 132 

DBBVKKALIGMKQH KILMER 65 

GAT GAA GAG GTG AAG AAG GCT TTG ATT GGC ATG AAG CAG ATG AAA ATC CTG ATG GAA AGA 2S2 

REEEHSKLMRTLKK CRBEKQ 85 

AGA GAG GAG GAA CAT AGC AAA CTA ATG AGA ACA CTG AAG AAA TGC AGA GAA GAA AAG CAG 312 

EALKL MNEVQEHLEEEERfcC 105 

GAG GCC CTG AAG CTT ATG AAT GAA GOT CAA GAA CAT CTA GAA GAG GAA GAA AGO CTA TGC 372 

Q V S L M G S W D EC K S C L B S DC M 125 

CAG GTG TCT CTG ATG GGT TCC TGG GAC GAA TGC AAA TCT TGC CTG GAA AGT GAC TGC ATG 432 

R P Y T T C Q S S W S S M X S T I E R V 145 

AGA TTT TAT ACA ACC TGC CAA AGC AGT TGG TCC TCT ATG AAA TCC ACG ATT GAA CGG GTT 492 

FRKIYQFLFP FHEDDEK E LP 165 

TTC CGG AAG ATA TAT CAG TTT CTC TTT CCT TTC CAT GAA GAC GAT GAA AAA GAG CTT CCT 552 

V GEKFTEED VQLMQ IENVFS 185 
GTT GGT GAG AAG TTC ACT GAG GAA GAT GTA CAG CTG ATG CAG ATA GAG AAT GTG TTC AGC 612 

Q L TVDVG F LY N M S F H V F K QM 205 

CAG CTG ACC GTG GAT GTG GGA TTT CTC TAT AAC ATG AGC TTT CAC GTC TTC AAA CAG ATG 672 

OOEFDLAFQS YFMSDTDSME 225 

CAG CAA GAA TTT GAC CTG GCT TTT CAA TCA TAC TTT ATG TCA GAC ACA GAC TCC ATG GAG 732 

P Y FFPAFS KEPAKKAHPMQS 245 

CCT TAC TTT TTT CCA GCT TTT TCC AAA GAG CCA GCA AAA AAA GCA CAT CCT ATG CAG AGT 792 

WDIPSF FQLFC NFSI-SVYQS 265 

TGG GAC ATT CCC AGC TTC TTC CAG CTG TTT TGT AAT TTC AGC CTC TCT GTT TAT CAA AGT 852 

V S ATVT B M 1» K A I E D I* S K Q D K 285 
GTC AGC GCA ACA GTT ACA GAG ATG CTG AAG GCC ATT GAG GAC TTA TCC AAA CAA GAC AAA 912 

DSAHGGPSSTT WPVR'GRGLC 305 

GAT TCT GCC CAC GGT GGA CCG AGT TCC ACG ACG TGG CCT GTG CGG GGC AGA GGG CTG TGT 972 

GEPGQNSSECbQFHARCQK.C 325 

GGA GAA CCT GGC CAG AAC TCG TCC GAA TGT CTC CAA TTT CAT GCA AGA TGC CAG AAA TGT 1032 

QD YLWAD CPAV PELYTKADE 345 

CAG GAT TAC CTA TGG GCA GAC TGC CCT GCT GTT CCT GAA CTA TAC ACA AAG GCG GAT GAG 1092 

ALELVNISNQQYAQVLQMTQ 365 

GCC CTT GAG TTG GTC AAC ATA TCC AAT CAG CAG TAT GCC CAG GTA CTC CAG ATG ACC CAG 1152 

HHLEDTTYLMBKMREQFG HV 385 

CAT CAC TTG GAG GAC ACC ACG TAT CTG ATG GAG AAG ATG AGA GAG CAG TTT GGT TGG GTA 1212 

T E It A S <?T PGS E N X FSFI K V V 405 

ACA GAG CTG GCC AGC CAG ACC CCA GGA AGC GAG AAC ATC TTC AGT TTC ATA AAG GTA GTT 1272 
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PGVHEGMFSKQDEK M XDIS I 425 

CCA GGT GTT CAC GAA GGA AAT TTC TCC AAA CAA GAT GAA AAG ATG ATA GAC ATA AGC ATT 1332 

LPSSNFTLTIPLEE SAESSD 445 

CTG CCT TCC TCT AAT TTC ACA CTC ACC ATC CCT CTT GAA GAA AGT GCT GAG AGT TCC GAC 13 92 

F ISYM LAKAVQHFK BHPKSW 465 

TTC ATT AGC TAC ATG CTG GCC AAA GCT GTA CAG CAT TTT AAG GAA CAT TTT AAA TCT TGG 14 52 

* 466 

TAA 14 55 

GCAGAGTATTTGATTAGGGACGTTTGCTG ATAGGA^^ 1534 

' ATACCrTGAAAACGTATTCAACCTCATTAATAATC 16 13 

ATTTTCAAATTAAAAAAAAAAATCCTGATAGAATGCA^ 1692 

GrTTTOTCTTTTTGAAAAGTTArraVATTATA 1771 

TGGAATCAATCCAAAGGAGTAAATCTAAAATTGAATTGA^ 1850 

TAGTAAACTGTTAAAAACl^AATGTCATCTGAAT^ 1929 

AAATATCITATAAAACCATTAAAAATATTTATAAAATTTA 2 008 

AAG CCTATCTATAAGG CAAATATTATTATTACTAT CTTCCAGAAAAG AAACTTG 2087 

CTCAGTCATaTCTGACTCTTTGAGACCCCTTGGACTC 2166 

AATACTGGAGCJAGGTTGCTATTTCCTTCTCCA 2245 

GGTAGATGCTTTACTATCTGAGCAACCAAATG^ 2324 

CTCTGAATCATAATTGCCACATTAAACTGGTTC 2403 

AAACTCTTATAAATGTAATGATCAAAACX3AAAAAAAATCT 2482 

&&SZGX&MXtGX3J^^ 2561 

GATTTCAAATTAGAAGATATGTTGCTAAAATAGCTAGGTAAATG^ 2640 

AAACFTTAGTATAAGTACTTCrAOT 2719 

ACAACTAAATGACATTTCAGACX^ACATT^ 2798 

AOlTACrrATTCTCCrTTGGTAAAATGOT 2877 

CVriXiTAGGTAGGTCCCTATCAATGTATAATTAAGCT 295 6 

TTOaAGAGGTTGTTGGTCATCAATCAAOT 3016 
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GTGAAGGTCCTTACAGAAGCTGGTGGCAACCTCGTTGGTC ATG AAG 76 

PPILVFIVYLLQLRD CQCAP 22 

CCG CCA ATC TTG GTG TTT ATC GTG TAT CTG CTG CAG CTG AGA GAC TGT CAG TGT GCG CCT 136 

TGKDRTS IREDPKGFSKAGE 42 

ACA GGG AAG GAC CGA ACT TCC ATC CGT GAA GAC CCG AAG GGT TTT TCC AAG GCT GGG GAG 196 

IDVDEEVKK A" LIGM KQMKIL €2 

ATA GAC GTA GAT GAA GAG GTG AAG AAG GCT TTG ATT GGC ATG AAG CAG ATG AAA ATC CTG 25 6 

MERRBEEHSKL MRTLKKC RB 82 

ATG GAA AGA AGA GAG GAG GAA CAT AGC AAA CTA ATG AGA ACC CTG AAG AAA TGC AGA GAA 316 

EKQEALKLMNEVQEHLB E BE 102 

GAA AAG CAG GAG GCC CTG AAG CTT ATG AAT GAA GTT CAA GAA CAT CTA GAA GAG GAA GAA 376 

RLCQVSLMGSWDBCKSCLES 122 

AGG CTA TGC CAG GTG TCT CTG ATG GGT TCC TGG GAC GAA TGC AAA TCT TGC CTG GAA AGT 436 

DCMRFYTTCQS SWSSMKS TI 142 

GAC TGC ATG AGA TTT TAT ACA ACC TGC CAA AGC AGT TGG TCC TCT ATG AAA TCC ACQ ATT 496 

ERVFRKIYQFLFPFH EDDE K 162 

GAA CGG GTT TTC CGG AAG ATA TAT CAG TTT CTC TTT CCT TTC CAT GAA GAC GAT GAA AAA S56 

E L p V G E K FT E E -.D V Q L M Q I EN 182 

GAG CTT CCT GTT GGT GAG AAG TTC ACT GAG GAA GAT GTA CAG CTG ATG CAG ATA GAG AAT 616 

V F B Q L T V D V G F L Y N M S F H V F 202 
GTG TTC AGC CAG CTG ACC GTG GAC GTG GGA TTT CTC TAT AAC ATG AGC TTT CAC GTC TTC 676 

KQMQQ BFDIiAFQSYFMS DTD 222 

AAA CAG ATG CAG CAA GAA TTT GAC CTG GCT TTT CAA TCA TAG TTT ATG TCA GAC ACA GAC 736 

S M E P Y F F P A F S K E P A K K A H P 242 

TCC ATG GAG CCT TAC TTT TTT CCA GCT TTT TCC AAA GAG CCA GCA AAA AAA GCA CAT CCT 796 

M Q S W D I pSFFQLFCNFSIiS V 262 

ATG CAG AGT TGG GAC ATT CCC AGC TTC TTC CAG CTG TTT TGT AAT TTC AGC CTC TCT GTT 856 

Y Q SVS ATVT E ML K A I ED L S K 282 
TAT CAA- AGT GTC AGC GCA ACA GTT ACA GAG ATG CTG AAG GCC ATT GAG GAC TTA TCC AAA 916 

Q D K D S A H G G P S S T T W P V R G R , „3J>2 ... 

CAA GAC AAA GAT TCT GCC CAC GGT GGA CCG AGT TCC ACG ACG TGG CCT GTG CGG GGC AGA 976 

<3LCGEPG0NSSECLQFKARC 322 

GGG CTG TGT GGA GAA CCT GGC CAG AAC TCG TCC GAA TGT CTC CAA TTT CAT GCA AGA -TGC 1036 

QKCQD YLWADCPAVPBLYTlC 342 

CAG AAA TGT CAG GAT TAC CTA TGG GCA GAC TGC CCT GCT GTT CCT GAA CTA TAC ACA AAG 1096 

ADEALBLVN ISNQOYAQVLQ 362 

CCG GAT GAG GCC CTT GAG TTG GTC AAC ATA TCC AAT CAG CAG TAT GCC CAG GTA CTC CAG 11S6 

MTQHKLEDTTYLMEKMREQF 302 

ATG ACC CAG CAT CAC TTG GAG GAC ACC ACG TAT CTG ATG GAG AAG ATG AGA GAG CAG TTT 1216 

GWVTELASQTPG SEMIFSF I 402 
GGT TGG GTA ACA GAG CTG GCC AGC CAG ACC CCA GGA AGC GAG AAC ATC TTC AGT TTC ATA 1276 

KVVPGVHEGKFS KQDEKM ID 422 
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AAG GTA GTT CCA OGT OTT CAC GAA GGA AAT TTC TCC AAA CAA GAT GAA AAG ATG ATA GAC 13 36 

XSII#PSSNFTI#TIPLEESAE 442 

ATA AGC ATT CTQ CCT TCC TCT AAT TTC ACA CTC ACC ATC CCT CTT GAA GAA AGT OCT GAG 13 96 

SS DFI SY MLAKAVQHFKEHF 4 62 

AGT TCC OAC TTC ATT AGC TAC ATG CTG GCC AAA GCT GTA CAG CAT TTT AAG GAA CAT TTT 14 S6 

K S W * 466 

AAA TCT TGG TAA 1468 

GCAGAGTATTTGATTAGGGACGTTTGCTGATA 1 54 7 

ATACCTTGAAAACXJTATTCAACCTC^TTAATAA 1626 

ATTTTCAAATTAAAAAAAAAATCXTTGATAGAATGCAATGA 1705 

TTTTGTC1T1 H 1UX?AAAAGTTATTG^ T TC T 1 1" T X AATATAATAATTCCACTTCT 1784 

GGAATOATCCAAAGK^GTAAATCTAAAATTGA^ 1863 

AGTAAACItnTAAAAACrGAAXtn*CATCT^ CTCCA 1942 

AATATCTTATAAAACXaTTAAAAATATTTA^ 2021 

AGCCTATCTATAACX3CAAATATTATTATT 2100 

TCAGTCATGTCTGACTCTTTGAGACCCCTTGGACTG^ 2 179 



GTAGATGCTTTACTATCnXlAGCAACCAAATGAATTACT 2337 
TCXtSAATCATAATTGCXACATTAAACT^ 241$ 




2258 



AACTCTATAAATGTAATGATCUIAAACX3AAAAAAAATCT 



2488 
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C T T G G A G TC AACTGAG TGTGQACTGAAAC TTCC AAAAACT Majority 
i * 1 1 

10 20 30 40 

L, JL - 1 — 

1 CTTGGAGTCAACTGAGTGTGGACTGAAACTTCCAAAAACT gphkngl815-l. 

1 C T TGGAGTCAACTGAG TGTGG AC TGAAACTTCCAAAAACT gp7b-l. 
1 CTTGGAGTC AACTGAG T G T G G AC T G A A A C T TC C AAAAACT gp7c-l . 
1 CTTGGAGTC A ACTGAGTGTGGACTGAAACTTCCAAAAACT gp7d-l . 

GACATGAGGAGTCACTGGAGAATCAT G ATCAAGGAG CTAC Majority 

. — — n 1 1 r 

SO 60 70 80 

«J ; £ 1 J 1- 

41 GACATGAGGAGTCACTGGAGAATCATGATCAAGGAGCTAC gphkngl815~l. 

41 GACATGAGGAGTCACTGGAGAATCATGATCAAGGAGCTAC gp7b-l . 

41 GACATGAGGAGTCACTGGAGAATCATGATCAAGGAGCTAC gp7c-X. 

41 GACATGAGGAGTCACTGGAGAATCATGATCAAGGAGCTAC gp7d~l. 

ACACTCTGACTTAACT T TAT T C TGTGGACAATGAGAGACA Majority 

i J ■ r 

90 100 110 120 

-JL 1 1 — ^ 

81 ACACTCTGACTTAACT TTATTCTGTG GACAATGAGAGACA gphkngl815-l. 
81 ACACTCTGACTTAACTTTATTCTGTGGACAATGAGAGACA gp7b~l, 
81 ACACTCTGACTTAACTTTATTCTGTGGACAATGAGAGACA gp7c-l. 
81 ACACTCTGACTTAACTTTATTCTGTG GACAATGAGAGACA gp7d-l. 

ACTGCAAGGATTAACAG TO AGAACATGAAQCTGCCACTTT Majority 

130 140 150 160 

t I I — *- . 

121 ACTGCAAGGATTAACAGTGAGAACATGAAGCTGCCACTTT gphkng!815-l . 

121 ACTGCAAGGATTAACAGTGAGAAC AT GAAGCTGCCACTTT gp7b-l. 
121 ACTGCAAGGATTA AC AGTGAGAACATGAAGCTGCCACTTT gp7c-l. 
121 ACTGCAAGGATTAACAG TGAGAACATGAAGCTGCCACTTT gp7d-l. 

TGATGTTTCCCGTGTG T C T G C T AT GGTTGAAAGACTGTCA Majority 
I I " ' . ■ i . 
170 180 190 200 
i .... i « i 

161 TGATGTTTCCCGTGTGTCTGCTATGGTTGAAAGACTGTCA gph)cngl815-l. 

161 TGATGTTTCCCGTGTGTCTGCTATGGTTGAAAGACTGTCA gp7b-l. 

161 TOATGTTTCCCGTGTGTCTGCTATGGTTGAAAGACTGTCA gp7c-l . 

161 TGATGTTTCCCGTGTGTCTGCTATGGTTGAAAGACTGTCA gp7d~l- 

TTGTGCACCTACTTG G AAOQACAAAACTGCCATCAOTQAA Majority 
I 1 ■ '1 1 

210 220 230 240 

i y » ■ 

201 TTGTGCACCTACTTGGAAGGAC A A AACTGCCATCAGTGAA gphJoigl815-l 
201 TTGTGCACCTACTTGGAAGG AC AAAACTGCCATCAGTGAA gp7b-l- 
201 TTGTGCACCTACTTGGAAGGACAAAACTGCCATCAGTGAA gp7c-l - 
201 TTGTGCACCTACTTGGAAGGACAAAACTGCCATCAGTGAA gp7d-l . 

AACGCGAACAGTTTT T C TGAGGCTGGCGACATAGACGTAG Majority 

, — , , p r 

250 260 270 280 

241 AACGCGAACAGTTTTTCTGAGGCTGGGGAGATAGACGTAG gphkngl815-l 
241 AACGCGAACAGTTTTTCTGAG Q C TGGGGAGATAGACGTAG gp7b-l. 
241 AACGCGAACAGTTTTTCTGAGGC TGGGGAGATAGACGTAG gp7c-l - 
241 AACGCGAACAGTTTTTCTGAGGCTGGGGAGATAGACGTAG gp7d-l - 



FIG. 14A 



WO 99/47535 



44/57 



PCT/US99/05606 



A T G G A G A C G T G AAGATAGCTTTGATTCGCATTAAACAGAT Majority 

\ 1 I — ; — r 

290 300 310 320 

I I 1 L. 

261 ATGGAGAGGTGAAGATAGCTTTGATTGGCATTAAACAGAT gphknglSlS- 
281 ATGGAGAGGTGAAGATAGCTTTGATTGGCATTAAACAGAT gp7b-l . 
281 ATGGAGAGGTGAAGATAGC7TTGATTGGCATTAAACAGAT 0p7c-l . 
281 ATGGAGAGGTGAAGATAGCTTTGATTGGCATTAAACAGAT gp7d-l . 

GAAAATCATQATQQAAAQGAQAQAGGAAQAACACAGCAAA Majority 
I i i i 

330 340 350 360 

I I I I— 

321 GAAAATCATGATGGAAAGGAGAGAGGAAGAACACAGCAAA gphkng!815 

321 GAAAATCATGATGGAAAGGAGAGAGGAAGAACACAGCAAA gp7b-l . 
321 GAAAATCA TGATGGAAAGGAGAGAGGAAGAACACAG C A A A gp7c-l . 
321 GAAAATCATGATGGAAAGGAGAGAGGAAGAACACAGCAAA gp7d-l . 

CTAATGAAAACCTTGAAGAAGTGCAAAGAAGA A AAGCA G G Majority 

i 1 1 r 

370 380 390 400 

I U I 1— . 

361 CTAATGAAAACCTTGAAGAAGTGCAAAGAAGAAAAGCAGG gphkngl815 

361 CTAATGAAAAC CTTG AAGAAGTG CAAAGAAGAAAAG C A G G gp7b-l. 

361 CTAATGAAAAC CTTGAAGAAGTG CAAAGAAGAAAAG C A G G gp7c-l, 

361 CTAATGAAAAC CTTGAAGAAGTG CAAAGAAGAAAAG C A GO gp7d-l, 

A G G C C C T G A A A C T T A T G A A T G A A G T T C A T G A A CACCTGQA Majority 

1 1 1 I 

410 420 430 440 

' I L 1— 

401 AGGCCCTGAAACTTATGAATGAAGTTCATGAACACCTGGA gphkngl815 

401 AGGCCCTGAAACTTATGAATGAAGTTCATGAACACCTGGA gp7b-l. 
401 AGGCCCTG AAACTTATGAATGAAGTTCATGAACACC T G G A gp7c-l . 
401 AGGCCCTGAAACTTATGAATGAAGTTCATGAACACCTGGA gp7d-l . 

GGAGGAAGAAAGCTTATGCCAGGTTTCTCTG GCAGATTCC Majority 

1 1 1 ' r 

450 460 470 480 

I : 1 I ,.„■ L. 

441 GGAGGAAGAAAGCTTATGCCAGGTTTCTCTGGCAGATTCC gpfcknglSlS 

441 G G A G G A A GAAAGCTTATGCCAGGTTTCTCTGGCAGA T T C C gp7b-l. 

441 GGAGGAAGAAAGCTTATGCCAGGTTTCTCTGGCAGATTCC gp7c-l. 

441 G G A G OK AGAAAGCTTATGCCAGGTTTCTCTGGCAQATTCC gp7d-l. 

TGGGATGAATGCA GGGCTTGCC T QGAAAGTAACTGCATGA Majority 

I 1 I T" 

490 500 510 520 

l I „ ,. - I „J — ; » 

481 TGGGATGAATGCAGGGCTTGCCTGGAAAGTAACTGCATGA 0phkngl815 

481 TGGGATGAATGCAGGGCTTGCCTGGAAAGTAACTGCATGA gp7b-l. 

481 TGGGATGAATGCAGGGCTTGCCTGGAAAGTAACTGCATGA gp7c-l* 

481 TGGGATGAATGCAGGGCTTGCCTQGAAAGTAACTGCATGA gp7d-l. 

GGTTTGATACCACCTGC C A A C C T G C A TGGTCCTCTG TGAA Majority 

r— , ( r- 

530 540 550 560 

» . I I -L_ 

521 GGTTTGATACCACCTGCCAACCTGCATGGTCCTCTGTGAA gphkngl815 

521 GGTTTGATACCACCTGCCAACCTGCATGGTCCTCTGTGAA gp7b-l . 

521 GGTTTGATACCACCTGCCAACCTGCATGGTCCTCTGTGAA gp7c-l- 

521 GGTTTGATACCACCTGCCAACCTGCATGGTCCTCTG TG A A gp7d-l. 
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AAATATGG 



— , — 
570 



— , — 
580 



— I — 

590 
JL. 



Majority 



600 



561 A A A T A T Q G IT OOAACAGTTTTTCAGC.AAGAT C T A TC AGTTtI gphknglSIS* 

561 AAATATGG gp7b-l, 

561 AAATATGG gp7c-l! 

561 A A AT AT G0 gp7d-l. 



601 
569 
569 
568 



569 
568 



AAAATGACAGAAGTGGCCCTGTCA Majority 



610 
J- 



620 
JL 



630 



640 



ICTGTTTCCTCTCC AG g| a AAATGACAGAAG TG GCCCTGTCA gphkng!815- 

AAAATGACAGAAGTGGCCCTGTCA gp7b-l. 

gp7c-l. 



gp7d-l. 



G C A A A G G G G TCACTGAGGAAGATGCGCAGGTGTCACAC A T Majority 

'" k I Mil i i I " ..-«■■..■ ip-ii. iini|n 



T 

650 



660 

„ !, 



670 



680 



641 GCAAAGGGGTCACTGAGGAAGATGCGCAGGTGTCACACAT gphknglSlS- 
593 GCAAAGQQGTCACTGAGGAAGATGCGCAGGTGTCACACAT gp7b-l. 



gp7c-l. 
gp7d-l. 



569 
568 



AGAGCATGTGTTCAGCCAGCTGAGCGCAGATGTGACATCT Majority 
■ i I | 



690 



700 
— I 



710 



720 



681 AGAGCATGTGTTCAGCCAGCTGAGCQCAGATGTGACATCT gphkngl8l5- 
633 AGAGCATGTGTTCAGCCAGCTGAGCGCAGATGTGACATCT gp7b-l. 



gp7c-l . 
gp7d-l . 



721 
673 
569 
568 



C T C T T CAACAGAAGC C TTTACGTCTTCAAACAGCTGCGGC Majority 

" » " 1 1 I 

730 



740 
_1_ 



750 



760 



CTCTTCAACAGAAGCCTTTACGTCTTCAAACAGCTGCGGC gphkng!815- 
CTCTTCAACAGAAGCCTTTACGTCTTCAAACAGCTGCOQC gp7b-l. 



gp7c-l, 
gp7d-l. 



569 
568 



C A O A A TTTGACCAOGCTTT TCAGTCATATTTCACATCGGG Majority 
p- — , , r 

770 



780 

JL 



790 



800 



1 1 I L_ 

761 GAGAATTTGAC CAGGCTT TTCAGTCATATTTCACATCGGG gphk»gl815- 
n A * gy yQACCAOQCTTTTCAQ T C ATATTTCACATCGGG gp7b-l. 



713 G A G A A T 



gp7c-l. 
gp7d-l . 



S69 
568 



G A C T G A C G T TACAGAGCCTT TCTTTTTTCCATCTTTGTCC Majority 

• 1" I I 

810 820 



_1_ 



JL 



830 
JL 



840 
L 



801 GACTGACGTTACAGAGCCTTTCTTTTTTCCATCTTTGTCC gphknglSXS- 
753 GACTGACGTTACA G A GCCTTTCTTTTTTCCATCTTTGTCC gp7b-l. 



gp7c-l. 
gp7d-l. 
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A A G G A G C C AGCCTACAGAQ CAGATGCTGAO CCAAOCTGGG Majority 

1 : — i 1 -r 

850 860 870 880 

T I , 1 I 

841 AAGGAGCCAGCCTACAGAGCAGATGCTGAGCCAAGCTGGG gphkngl815- 
793 A AG G A GCCAGCC T AC AG AG C AG ATG CTQ AGCCA AGCTOGG gp7b-i. 

CCAGCCTACAGAG CAG ATG CTGAGCCAAGCTGGG gp7c~l. 
CCAGCCTACAGAG CAGATG CTGAGCCAAGCTGGG gp7d-l. 



569 
568 



CCATTCCCAATGTCTTCCAGCTGCTCTGCAACTTGAQTTT Majority 
i t i i 

890 900 910 920 

I ' « L. 

881 CCATTCCCAATGTCTTCCAGCTGCTCTGCAACTTGAGTTT gphkng!8X5-l . 
833 CCATTCCCAATGTCTTCCAGCTGCTCTGCAACTTGAGTTT gp7b-l. 
605 C C A T T C C C A A T G T C T T C C A G C T G C T C T G C A A C T T G A G T T T gp7c-l . 
602 CCATTCCCAATGTCTTCCAGCTGCTCTGCAACTTGAGTTT gp7d-l. 

CTCAGTTTATCAAAGTQTCAQT Q A A A A A C TCATCACAACC Majority 
-— I , , p 

930 940 950 960 

1 1 I— , I* 

921 CTC AOTTT ATC AAAG TGTC AGTGAAAAACTCATCACAACC gpfckngX815-l, 
873 CTCAGTTTATCAAAG TGTGAGTGAAAAAC TCATCACAACC gp7b-l. 
645 CTCAGTTTATCAAAG TGTCAGTGAAAAAC TCATCACAACC gp7c-l. 
642 CTCAGTTTATCAAAG TGTC AGTGAAAAAC TCATCACAACC gp7d-l. 

CTGCGTGCCACAGAGGACCCTCCAAAACAAGACAAAGACT Majority 

i i i r 

970 980 990 1000 

I I t I 

961 CTGCGTGCCACAGAGGACCCTCCAAAACAAGACAAAGACT gphkngl815-l . 
913 CTGCGTGCCACAGAGGAC C CTCCAAAACAAGACAAAGAC T gp7b-l . 
685 CTGCGTGCCACAGAGGAC C CTC CAAAACAAGACAAAGACT gp7c-l. 
682 CTGCGTGCCACAGAGGAC CCTCCAAAACAAGACAAAGACT gp7d-l . 

CCAACCAGGGAGGCCCGATTTCAAAGATACTACCTG A G C A Majority 

1 1 1 r 

1010 1020 1030 1040 

I , I t L. 

1001 CCAACCAGGGAGGCC CGAT TTCAAAGATACTACCTGAGCA gphkngl815-l. 

953 CCAACCAGGGAGGCCCGATTTCAAAGATACTACCTGAGCA flp7b-l . 

725 C C A ACCAGGGAGGCC CGATTTCAAAGAT AC TACCTGAGCA gp7c-l - 

722 CCAACCAGG GAG G C C CGAT IT CAAAGATACTACCTGAGCA gp7d-l. 

A G A CAOAGGCTCAGATGQGA A ACTT G G C C AGAATTTQTCT Majority 
I I 1 » . 

1050 1060 1070 1080 

I, I I L. 

1041 AGACAGAGGCTC AG ATGGGAAACTTGGCCAGAATTTGTCT gphkngl815-l . 

993 AOACAGAGGCTCAGATGGGAAACTTGGCCAGAATT T G T C T gp7b-l. 

765 AGACAGAGGCTCAGATGGGAAACTTGGCCAGAATTTGTCT gp7c-l . 

762 AGACAGAGGCTCAGATGGGAAACTTGGCCAGAATTTGTCT gp7d-l. 

QATTQC G T TAATTTTCQCAAGAGATGCCAGAAATGCCAGG Majority 

1090 1100 1110 1120 

I I I t- 

1081 GATTGCGTTAATTTTCGCAAGAGATGCCAGAAATGCCAGG gphtaiglBlS-l. 

1033 GATTGCGTTAATTTTCGCAAGAGATGCCAGAAATGCCAGG gp7b-l. 

805 GATTGCGTTAATTTTCGCAAGAGATGCCAGAAATGCCAGG gp7c-l . 

802 GATTGCGTTAATTTTCGCAAGAGATGCCAGAAATGCCAGG gp7d-l . 
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ATTATCTATCTGATGACTGCCCT A A T G T G C CTGAACTATA Majority 

r— , , r 

1130 1140 1150 1160 

■ 1 t U 

1121 ATTATCTATCTGATGACTGCCCTAATGTGCCTGAACTATA gphkngl815-l . 
1073 ATTAT CTATCTGATGACTGCCCTAATGTG CCTGAACTATA gp7b-l. 
845 ATTA? CTATCTGATGACTGCCCTAATGTG CCTGAACTATA gp7c-l. 
842 ATTATCTATCTGATGACTGCCCTAATGTGCCTGAACTATA gp7d~l. 

CAGAGAAC TCAATGAGGCCCTCC GACTGG TCAGTAGATCC Majority 

i - i 1 r 

1170 „ 1180 1190 1200 

— i-J 1 1 l_ 

1161 CAGAGAACTCAATGAGGCCCTCCGACTGGTCAGTAGATCC gphkngl815-l. 
1113 CAGAGAACTCAATGAGGCCCTCCGACTGGTCAGTAGATCC gp7b-l. 
885 CAGAGAACTCAATGAGGCCCTCCGACTGGTCAGTAGATCC gp7c-l . 
882 CAGAGAACTCAATGAGGCCCTCCGACTGGTCAGTAGATCC gp7d-l. 

A A TCAGCAATACGACCAGGT G GTGCAGATGACCCAGTATC Majority 
( , : , p. 

1210 1220 1230 1240 

L _ 1 I U 

! 1201 AATCAGCAATACGACCAGGTGGTGCAGATGACCCAGTATC gphkngl815-l, 
1153 AATCAGCAATACGACCAGGTGGTGCAGATGACCCAGTATC gp7b-l. 
925 AATCAGCAATACGACCAGGTGGTGCAGATGACCCAGTATC gp7c-l . 
922 AATCAGCAATACG AC CAGGTGGTGCAGATGACCCAGTATC gp7d-X. 

ACCTGGAAQACACCACGCTTCTGATGGAOAAGATGAGAGA Majority 

• ^ i I'm"-'— " -"|-< 

1250 1260 1270 1280 

1 — ' t ■ 

1241 ACCTGGAAGACACCACGCTTCTGATGGAGAAGATGAGAGA gphkngl815-l . 
1193 ACCTGGAAGACACCACGCTTCTGATGGAGAAGATGAGAGA gp7b-l, 
965 ACCTGGAAGACACCACGCTTCTGATGGAGAAGATGAGAGA gp7c-l. 
962 ACCTGGAAGACACCACGCTTCTGATGGAGAAGATGAGAGA gp7d-l, 

GCAGTTTGGCTGGGTTTCTGAACTGGCAT AC CAGTCCCCA Majority 

' 1 " ■ i '" ""™ " - "" '"i' m 

1290 1300 1310 1320 

1 1— I L. 

1281 GCAGTTTGGCTGGGTTTCTGAACTGGCATACCAGTCCCCA gphkngl815-l , 
1233 GCAGTTTGGCTGGGTTTCTGAACTGGCATACCAGTCCCCA gp7b-l* 
1005 GCAGTTTGGCTGGGTTTCTGAACTGGCATACCAGTCCCCA gp7c-l. 
1002 GCAGTTTGOCTGGGTTTCTGAACTGOCATACCAGTCCCCA gp7d-l* 

QGAOCTGAGGACATCTTTAATC CAGTGAAAGTAATGGTAG Majority 
I , I I 1 

1330 1340 1350 1360 

' ' * i 

1321 GGAGCTGAGGACATCTTTAATCCAGTGAAAGTAATGGTAG gphkngl815-l . 
1273 GGAGCTGAGGACATCTTTAATCCAGTGAAAGTAATGGTAG gp7b-l. 
1045 GGAGCTGAGGACATCTTTAATCCAGTGAAAGTAATGGTAG gp7c-l. 
1042 GGAGCTGAGGACATCTTTAATCCAGTGAAAGTAATGGTAG gp7d-l. 

CCCTAAGTGCTC ATGAAGG AAATTCTTCTGATCAAGATGA Majority 

i i ■ t r. 

1370 1380 1390 1400 

. 1 ; 1 ; i l. 

1361 CCCTAAGTGCTCATGAAGGAAATTCTTCTGATCAAGATGA gphkngl81S-l . 
1313 CCCTAAGTGCTCATGAAGGAAATTCTTCTGATCAAGATGA gp7b-l. 
1085 CCCTAAGTGCTCATGAAGGAAATTCTTCTGATCAAGATGA gp7c-l- 
1082 CCCTAACJTGCTCATGAAGGAAATTCTTCTGATCAAGATGA gp7d-l. 
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C ftCAQTQGTTCCTTCAAOCCTCGTOCCTTCCTCTAACTTC Majority 
i i i • 

1410 1420 1430 1440 

. _i I 1 1 — L. 

1401 CACAGTGGTTCCTTCAAGCCTCCTGCCTTCCTCTAACTTC gphkng!815-l. 
1353 CACAGTGGTTCCTTCAAGCCTCCTGCCTTCCTCTAACTTC gp7b-l. 
112S CACA6TGOTTCCTTCAAGCCTCCTGCCTTCCTCTAACTTC gp7c-l. 
1122 CACAGTGGTTCCTTCAAGCCTCCTGCCTTCCTCTAACTTC gp7d-l. 

ACACTCAGCAGCCCTCTTGAAAAGAQTGC TGGCAACGCTA Majority 

1 1 i r 

1450 " 1460 1470 1480 

I I 

1441 ACACTCAGCAGCCCTCTTGAAAAGAGTGCTGGCAACGCTA gphkng!315-l . 

1393 ACACTCAGCAGCCCTCTTGAAAAGAGTGCTGGCAACGCTA gp7b-l. 

1165 ACACTCAGCAGCCCTCTTGAAAAGAGTGCTGGCAACGCTA gp7c-l. 

1162 ACACTCAGCAGCCCTCTTGAAAAGAGTGCTGGCAACGCTA gp7d-l. 

ACTTCATTGATCACGTGGTAGAGAAGGT TCTTCAGCACTT Majority 

j , , : r 

1490 1500 1510 1520 

i - — ; 1 1 1 l- 

1481 ACTTCATrGATCACGTGGTAGAGAAGGTTCTTCAGCACTT gphkngl815-l ♦ 

1433 ACTTCATTGATCACGTGGTAGAGAAGGTTCTTCAGCACTT gp7b-l . 

1205 ACTTCATTGATCACGTGQTAGAGAAGGTTCTTCAGCACTT gp7c-l . 

1202 ACTTCATTGATCACGTGGTAGAGAAGGTTCTTCAGCACTT gp7d-l . 

TAAGGAGCACTTTAAAACTTGGTTA AGAAGATTTAGTCCAT Majority 

_____ , , , r- 

1530 1540 . 1550 1$60 

I I 1 *~ 

1521 TAAGGAGCACTT TAAAACTTGGTAAG AAGATTTAGTCCAT gphkiigl815-l . 
1473 TAAGGAGCACTTTAAAACTTPGGTAAGAAGATTrPAGTCCAT gp7b-l • 
1245 TAAGGAGCACTTTAAAACTTGGTAAGAAGATTTAGTCCAT gp7c-l . 
1242 TAAGGAGCACTTTAAAACTTGGTAAGAAGATTTAGTCCAT gp7d-l . 

CCTATAATCAGCAAGAATTACACCT TCGGCCAAGACCT G A Majority 

, , 1 ; 1 — — : r 

1570 1580 1590 1600 

. i ill — . I — »- 

1561 CCTATAATCAGCAAGAATTACACCTT CGGCCAAGAC CTGA gphkngl815-l . 
1513 CCTATAATCAGCAAGAATTACACCTTCGG CCAAGAC CTGA gp7b-l . 
1285 CCTATAATCAGCAAGAATTACACCTTCGGCCAAGACCTGA gp7c-l . 
1282 CCttATAATCAGCAAQAATTACACCTtCGGCCAAGACCTGA gp7d-l . 

GAATTCTGAAA ATACAAAGCAOOCTAACACAATGAACACA Majority 

1 1 1 r 

1610 1620 1630 1640 

i t ' 1 

1601 GAATTCTGAAAATACAAAGCAGGCTAACACAATGAACACA gphkngl815~l. 
1553 GAATTCTGAAAA^ACAAAGCAGGCTAACACAATGAACACA gp7b-l - 
1325 GAATTCVGAAAATACAAAGCAGGCTAACACAATGAACACA gp7c-l . 
1322 GAATTCTGAAAATACAAAGCAGGCTAACACAATGAACACA gp7d-l. 

GCTQCATGAAAOTTAGGTATATATTAGGAAGCACTATTGG Ma j ority 
& • * 1 

1650 1660 1670 1680 

■ ; i » 

1641 GCTGCATGAAAGTTAGGTATATATTAGGAAGCACTATTGG gphkl»gt8l5-l. 

1593 GCTGCATGAAAGTTAGGTATATATTAGGAAGCACTATTGG gp7b-l. 

1365 GCTGCATGAAAGTTAGGTATATATTAGGAAGCACTATTGG gp7c-l. 

1362 GCTGCATGAAAGTTAGGTATATATTAGGAAGCACTATTGG gp7d-l. 
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TTTACTTTGTTGAATGGAAGTTTAATAGCTATTCA A A T T G Majority 

* i i | 

1690 1700 1710 1720 

1 — 1 1 U 

1681 TTTACTTTGTTGAATGGAAGTTTAATAGCTATTCAAATTG gphkngl815-l 
1633 TTTACTTTGTTGAATGGAAGTTTAATAGCTATTCAAATTG gp7b-l. 
1405 TTTACTTTGTTGAATCGAAGTTTAATAGCTATTCAAATTG gp7c-l . 
1402 TTTACTTTGTTGAATGGAAGTTTAATAGCTATTCAAATTG gp7d-l . 

AGTTAATATAAAAATTTCTTCCTAAAAACTAAA A T G T A C A Majority 

• * i | 
1730 1740 1750 1760 

1 !■ .. I L_ 

1721 AGTTAATATAAAAATTTCTTCCTAAAAAGTAAAATGTACA gph)cngl815-l 
1673 AGTTAATATAAAAATTTCTTCCTAAAAAGTAAAATGTACA gp7b-l . 
1445 AGTTAATATAAAAATTTCTTCCTAAAAAGTAAAATGTACA gp7c-l . 
X442 AGTTAATATAAAAATTTCTTCCTAAAAA G.T AAAATG TACA gp7d-l. 

TATGTAGAATATGATQCATTAGTTCT7TGTATACTAAATA Majority 

. * » » ' r 

1770 1780 1790 1800 

' L— 1 u 

1761 TATGTAGAATATGATGCATTAGTTCTTTGTATACTAAATA gphkngl815-l 
1713 TATGTAGAATATGATGCATTAGTTCTTTGTATACTAAATA gp7b-l . 
1485 TATGTAGAATATGATGCATTAGTTCTTTGTATACTAAATA gp7c-l, 
1482 TATGTAGAATATGATGCATTAGTTCTTTGTATACTAAATA gp7d-l . 

AATACTGAGTCCCCT Majority 
1810 



1801 AATACTGAGTCCCCT gphkngl815-l 

1753 AATACTGAGTCCCCT gp7b-l. 

1525 AATACTGAGTCCCCT gp7c*-l. 

1522 AATACTGAGTCCCCT gp7d-l. 
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bhkngl GCAACCrCGTTGGTGAGAGCCTGCAGTTTlGTGTCACGGCGGAAACATGAAGCCGC 

bhkng2 CAGAAGCTGGTGX3CAACCTCX3TTGGTGAGAGCCTGCAGT7A 

bhkng3 gtgaaggtccttagagaagctggtggcaacctcg'^^ 

81 160 

bhkng2 CACTCTTGGTGTTTATTGTGTATCTGCTGC 
bhkng3 CAATCTTGGTGTTTATCGTGTATCTGCTGCAGCT^ 

1<*1 240 
bhkngl CGTGAAGACCCQAAGGGTTTTTCCAAGGCTG<X^AGATAGACGTAGATGAA^ 
bhkng2 CGTGAAGACCCGAAGGGTTTTTCGAAGGCTG<K3^ 

bhkng3 (XH'GAAGACCCXSAAGXKSTTTTTCCAAG^ CATG AA 

241 . 320 

bhkngl GCAGAIXIAAAATCXTTGATGCIAAAGAAGA^ 
bhkng2 GCAGATOAAAATCaCGATGC^^ 
bhkng3 GCAGATGAAAATCCTGATGGAAAGAAGAGA 

321 400 
bhkngl AGCAGGAGGCCCTGAAGCTTATGAA^ 
bhkng2 AGCAGGtAGGCCCTGAAGCITATCAATGAAG^ 
bhkng3 AGCAGGAG&CCCTGAAGCTTATGAATC 

401 480 
bhkngl GG1TCCTGGGACGAATGCAAATCTTO 
bhkng2 GOTTCCTG^JACXlAATGCAAATCr^ 
bhkng3 Gk3TTCCrGGGACX3AATG(yU\ATCTTC 

481 560 
bhkngl TATGAAATCCAO^TTGAACGGGTTTTCCG<3AAGATA^ 
bhkng2 TATGAAATCCACX&TTGAACGX^ 
bhkng3 TATGAAATCCACGATTGAACXXX3TTTTCC 

561 €40 
bhkngl TTCCTGTTGGTGAOAAGOTCACrGAGGAAGArrGTAC^ 
bhkng2 TTCCTGTTGGTGMAAGOTCACTGA^ 
bhkng3 TTCCTGTTGGTGAGiAAGTTCACTGAGGAAGATGTA^ 

€41 720 
bhkngl . GTGGCakTTTCTCTATAAX2ATGM 
bhkng2 GTCJGGATTTCTCTATAACATG^ 
bhkng3 GTGGGATTTCTCTATAACATX1AGCT 

721 BOO 
bhkngl TATGTCAOACACAGACTCCATGGAG^^^ 

bhkng2 TAlxmsAOACACAOACrc^^ 
bhkng3 TAtGTCAGACACAGUU^rcc^^ 

601 880 
bhkngl AGAgnKSGCftC^^^ 
bhkng2 AGAGrrGQGACATTCCXA ^ 
bhkng3 AGAGTTGK-K3AJCATTCC 
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881 960 
bhkngl AC^AGATGCTGAAGGCC^TTGAGGACTTATCCAAACAAGAC^ 
bhkng2 ACAGAGATGCTGAAGGCCATTGAGGACTTATCCAAAC^^ 

bh)cng3 ACAGAGATGCTTGAAGGCCATTGAGGACTTATCCAAACAAGACAAAGATTCTGCCCA 

961 10 4 0 

bhkngl GCCTGTGCGGGGCAGAGGGCTGTGTGGAGAACCTGGCC^ 
bhkng2 GCCTGTG03GGGCAGAGG<3CTGTGTGGAGAACCTGGCCAGAACrCGTCCGA^ 
bhkng3 GCCTGTGCX!K3G<3CAGAGG<3CTC 

1041 1120 
bhkngl AATGTCAGGATTACCTATGGG GAG ACTG CCCTGCIXSTTCCTGAACTATACACAAAGG CGGATGAGGCCCTTG AGTTGGTC 
bhkng2 AATGTCAGGATTACCTATG^CAGACTGCCCTGCTGTTCCT^ 
bhkng3 AATGTCAG0tflTTA{!CTATGG<K3U^ 

1121 X200 
bhkngl AACATATCCAATGfcGCAGTATCCCCM 
bhkng2 AACATATCCAATGAGCAGTATGCCCAGGT^ 
bhkng3 AACATATCCAATCAGCAGTATGCCC^GGTACTCCAGATGACC 



bhkngl 
bhkng2 
bhkng3 

bhkngl 
bhkng2 
bhkng3 

bhkngl 
bhkng2 
bhknga 



1201 ' 1280 

GATGAGAGLAGCAGTirTTGGTTXiGGTAACAGAG CTGGCCAGCCAGACCCCAGG AAG CGAG AACATCTTCAGTTTCATAAAGG 

GATGAGAGAGCAGTTTGGTTGGGttAACAG^ 

GATGAGAGAGCAGfiTTT0K3TTGK^ 

1281 ^ 1360 



1361 1440 

TTCAC^CTCACCATCCCTCrrTGAAGAAAGTGC^ 

TTCACACTCACCATCCCTCrTGAAGAAAGTC 

TT^CACrCACCATCTCTCTTGAAGAAAGTGCTG^ 



1441 1520 
bhkngl TTTTAAGGAACATTTTAAATC1TGGTA 
bhkng2 TTTTAAGGAAOITTTTAAATCTTGOT 
bhkng3 TTTTAAC^3^\CATTTTAAATCXTGGTAAGCAG^ 

1521 1600 
bhkngl GGGJVAAAATGIACAAAACTAGCTTTTGAATACC^^ 
bhkng2 GGGAAAAATGACAAAACTAGCTTTTaAAT^ 
bhkng3 GG&AAAAATGACAAAACTAG47TTTTGA 

1601 1680 
bhkngl ACAAGTTAGCAGTTTTTACCTATTO^ 
bhkng2 ACAAGTTAGCAGTTTTTACXrrATT^ 

bhkng3 ACAAGTTAGCAGTTTTTACCT^ . TCCTGATAGAATGCAATOAAATOAOAATTCT^ 

1681 1760 
bhkngl ATATGTGATTGCCAGAAACAAAiCTGGTTra 
bhkng2 ATATGTGATTGCCSIGAAACAAACTGGT^ 
bhkng3 ATATGTGATTGCCAGAAACAAACIGGTTTTG^ 
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17€1 M« 

bhkngl TTTTTAATATAATAATTCX^CTTC^^ 
bhkng2 TTTTTA ATATAaTAATTCCACTTCTO^ 
bhkng3 TTTTTAATATAATAATTCXIACTT 

1841 „ 20 
bhkngl CAATATTTGCaUUWTTATTTAAAA^ 
bhkng2 CAAT ATTTG CAAATmTTTAAAATAGTAAACTGOTA 
bhkng3 CAATATTTGCAAATTATTXAAAATACT 



1921 



2000 



bhkngl AAGCTGTCWCTAAATATGCTCCM^^^ 
bhkng2 AAGCTGTGGCTAAATATCXrTCCAAATA^ 
bhkng3 AAGCK5TGGCTAAATATGCTCQUUITATCTTATA 

2001 2030 
bhkngl GCTGGAACAACaV GTTT ATTCTA^^ 

bhkng2 GCTGGAACAAQAGTTTATTCTAAGCCTATCTATAAGGCAAATATC 
bhkng3 GCTGGAACAACWVGTTTATTCTAAGCCTATCTATA^ 



2081 



2160 



bhkngl OXGGGTCCAAGTGfTTAGTTCKrrCAGTCATGTCTGA 
bhkng2 CAGGGnTCCAAGTGTO 

bbkng3 CAGCK3TCX!AA0TOTTAOTTGCTCAOTCATG^ 

2161 2240 
bhkngl CrTGGGATTCTTCyUlA.CAGGAATACTGG 
bhkng2 ATGGGATTCTl'a\GACAAGU\AT^ 
bhkng3 ATGGGAi. auaiC^GACAAGAATACTG^ 

2241 2320 
bhkngl CRGGTCTCCTGCATTGCAGGTAGATO 
bhkng2 CAGGTCTCCTGCATTGCAGOTAGATGCTT^ 
bhkng3 CAGGTCTC<nK3CI&TTGCAGCT 

2321 2400 

bhkngl AATTTTAACTTACn^TTTCTCTGA 
bhkng2 AATTTTAACTTAGTrXTCT^ 

bhkng3 aattttaacttaqttttctcxga^ 

2401 2480 
bhkngl GTGAAAAATC1AGTATAAAACTCTATAAAT 
bhkng2 GTGAAAAATGAGTATAAAACTCTATAAATGTA^ 
bhkng3 GTGAAAAATGAGTATAAAACTCTATAAATGTAATGATCAAAA 

2481 2560 

bhkngl GTTGGCAGG «. „ . 

bhkng2 <nTGGCAG<3AATTACX5GTTGGAAATGCJATGATTT^ 

bhkng3 GTTGGCAGG 

2561 2640 

bhkngl 

bhkng2 ATAAATAATTTTOM^TTTCAAAT^^ 

bhkng3 . 

2641 2720 

bhkngl ♦ * 

bhkng2 TOTTCTCATCXTTAAACTTTAaTA^ 

bhkng3 . . „ . 

2721 2800 

bhkngl * ;* 

bhkng2 TcrrACAGGAAAAACAACTAAATGACATC 

bhkng3 ♦ , 

2801 2880 

bhkngl 

bhkng2 ATTAGAACTGTACMAGTATTCTC 

bhkng3 . * , „ 

2881 2960 

bhkngl * * 

bhkng2 TCATAlWSiSACCTTGTAGGTA^ 

bhkng3 

2961 3029 

bhkngl , 

bhkng2 ctctgaatottggagaggttgttggtcatcaa 
bhkng3 
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-MKPPLLWIWXLRLRDCQCAPTaKDRTSIREDPKGFSKAaEIDVDEBVKK 



gphkn 3 181S_aa_ „ MKI^LIMPPVCIXWIja5CHCAPTWKDKTAISEWANSFSEAGEIDVIX3^"l 

SphkngiB 15_aa_ ^IGIKQMKIMHERREEEHSKLMKTLKKCKEEKQEMjKLHNBVHEHLEBEBSIX^VSIJU3 
h«Wcag_aa QP£WSSVKNKIERFFTlKIYQFI4'PFHEDNEmi^ISEKLIEEDAQk 
»htegl815_aa_ QPAWSSVKKM^^ 

l^^f-" TFQSHPISDTDLTEPYFFPArSKEPMTKRDLEtKTroiPNFFQI,FC^FSVSrYESVSCT 

bl*ngi_aa ^^k*^^pypfpafsio»akxahph<is^ 

"^" 91 -"* PSSTTHPVRaRGr^EPGONS5ECI^FHARCQKCQDYLWADCPAVTEI.YTKADKA^ 

h»hJci>g_a a AYLVEKMRGQFGWVSELANQAPETEI IFNS IQWPRI . .HKGNISKQDBTMMTDLSILPSSNFTLKIPLEKSAESSNPrG 

S pWcn g n8 1 5_aa_ TLMEKMREQFGWVSFJCAYQSPQAEDIFHPVKVMVAIjSAHEGNSSDQDD .IVVPSSL 

hnOikng_aa YWAKAI-QHFKBHFKTW 497 

bhkngl_a» YHIAXAVQHFKEHFKSW 
gphkagl81S_aa_ HWEXVLOHFKEHFKTW 
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matureHKNG - APTWKDKTAIS 

HKNGX-VX-IPF3 MRTWDYSNSGNMKPPLLWIVCLLWLWDSHCAPTWK33KTAIS 

HKNGX/ X-Vl -IPF2 MKPPLLVFIVCLLWLKDSHCAPTWKDKTAXS 

HKNGX-IPFX MKIKAEKNEGPSRSWWQLHWGDIAIWSGNKKPPLL^ 

*********** 

matureHKNG ENLKSFSEVGEIDADEEVKKALTGIKQMKIMMERKEKEHTNLM^^ 

HKNGX -VX-IPF3 ENLKS F S EVGE I DADE EVKKALTG I KQMK I MMERKEKEHTNLMS TLKKC REEKQ EALKI*L 

HKNGX / 1 -VI - 1 PF2 ENLKSFSEVGEIDADEEWKKAIjTGIKQMKIMMERKEKEH^ 

HKNGl-IPFl ENLKS F SEVGEI DADEEVKKAI/TG I KQMK IMMERKEKEHTNLMS*TLKKCREEKQEALKL.If 



matureHKNG NEVQEHLEEEERLCRESLADSWGECRSCLENNCMRIYTTCQPSWSSVKNKIER^ 
HKNGX-VX-IPF3 NEVQEHLEEEERKITCSIADSWGECRSC^ 
HKNGX/1-VX-IPF2 NEVQEHLEEEERLCRES&WSVIGECRSCLENNCMR^ 
HKNGX-IPFX NEVQ£HLEEEERWRESIJtf>SWGECRSCL^^ 



matureHKNG FLFPFHEDNEKDLPISEKLIEEEAQMQMED^ 
HKNGI-VX-IPF3 FLFPFHEDNEKDLPISEKLIEEDAQLTQMEDVFSQL^ 
HKNGX / 1 -VI - 1 PF2 FLFPFHEDNEKDLPISEKLIEEDAQLTQMEDVFSQLTTO 
HKNGX-IPFX FLFPFHEDNEKmPXSEKI,XSECAQI/I^^ 

************************************************************ 



matureHKNG TFQ$HFISDTOI*TEPVFFPAFSKEPMTKADLEQCWD!IPIIFFQX#FCNFSVSIYESVSETIT 
HKNGX -VX-IPF3 TFQSHFISDTDt/rEPYFFPAFSKEPKTiaUJLEQ^^ 

HKNGX/I-VI-IPF2 TFQSHFISMDDTEPYFFPAFSKEPMTKABLEQCWDIPNFFQLFCNFSVSIYESVSE1?IT 
HKNGX - 1 PF1 TFQSHFISOTD^TEPyFFPAFSKEPMTKABLEQCVroiPMFFQr.FCNFSVSIYBSVSE'TXT 



matureHKNG KMLKAIEDLPKQDKAPDHGGLISKMLPGQDRGI^GELDQNLSRCFKFHEKCQKCQAKLSE 
HKwGl-Vl-IPF3 KMLKAIEDI*FKQDKAPDHGGI.ISKMLP^ 

hkngx/ x -vi~xpf2 kmucaiedi,pkqdkapdhggi.iskml^ 

HKNGX-IPFX KM£»KAXEDLPKQDKAPDHGGI»XSKMLPGQDRGX^GEXJD^^ 



matureHKNG IXTPDVPALHTEIjDEAIRLVNVSNQQYGQILQMTRICHX. 
HKNGI-V1-IPF3 IXTPEWAIJrrEI^EAIRLVNVSNQQYGQII^MTRKKLEOT 
HKNGX/ I~VI-IPF2 DCPOTPALHTEkDEAXJUiVNV^^ 
HKNGI-XFFX 1X:PXJVFAI#HTEI^EAXRLVWSNQQYGQII^^ 



matureHKNG APETEIIFNSIQVVPRIHEGNISKQDETMMTOLSILPSSN^^ 

HK&31-VX-XPF3 APETEIiraSIQVVPRIHEC^ISKQDETMMTDI*SILPSSNFTI^IPLEESAESSNFIGW 
K»?GX/X-VX-XPF2 APKTSXIFNSIQVVPRXKEGNXSKQDETMMTDLSILPS^ 

HKNGX-IPFX APETEIIF1ISIQVVPRIHEGNISKQDETM^^©IUSILPSSNFTIJCIPI*EESAESSNFIGW 
************************************************************ 

matureHKNG VAKAX*QHFKEHFKTW 
HKNGX -VX-IPF3 VAKALQHFKEHFKTW 
HKNGX/X-VX-IPF2 VAKALQHFKEHFKTW 
HKNGX-IPFX VAKAIiQHFKEHFKTW 
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RH ^QARAAGLVST L E V A D T 
CGT 0X0 CAG GCC CGG GC < GCG GGG TTG GTT TCC ACC CTG GAG GTT GCT GAG ACC 

LC PRLTSSRWHRRi,Q QAArjK 
CTG TGC CCT CGG CTG ACT TCC AGC CGG TGG CAC AGA CGC CTC CAG GGG GCA GCA CTC AAG 

CGC ATC TTA GGA ATG ACA GAG TTG CGT CCC TCT CTG TTG CCA CGC TGG ACT tL GTG G^A 

CS#LTBASNs W VQvTLPP r,D 
TOT TCT TAG CTC ACT GAA GCC TCA AAT TCC TGG GTT CAA GTG ACC CTC CCA CCT CAG CCC 237 

CAT GAG GAC CTG GGA CTA CAG GAC aL GCT AAA TCC CTG ACA CGG ATG AAA ATT AAA GCA 2" 

GAG AAA AAC GAA GGT CCT TCC AGA AGC TGG TGG CAA CTT C^C TGG GGA GAT ATT GCA. AAT 357 

AAC AGC GGG AAC ATG AAG CCG CCA CTC TTG GTG TTT ATT GTG TGT CTG CTG TGG TTG AAA "7 

DSHCA PTWKDKTAI 5KMi 
GAC AGT CAC TGC GCA CCC ACT TGG AAG GAC AAA ACT GCT ATC AST GAA AAC CTG AAG AGT "7 

*' s E V G E I D A D B E V K It » r t , 
TTT TCT GAG GTG GGG GAG ATA GAT GCA GAT GAA GAG GTG AAG AAG GCT TTG ACT GOT ATT 537 

AKJ CM ATS AAA ATC ATC ATO GAA AGA AAA GAG AAG GAA CAC ACC AAT CTA ATG AGC ACC S97 

<^™aa\tgcagagLgLaLcLgLg^ ™ 

CAT CTG GAG GAA GAA GAA AGG CTA TSC CGG GAG TCT TTG GCA GAT TCC TGG GGT GAA TGC 717 

RSCLE NHCMr i v t Tcopsm _ .„.„ 

AGG TCT TGC CTG GAA AAT AAC TGC ATG AGA ATT TAT ACA ACC TGC CAA CCT AGC TGG TCC 777 

TCT GTG AAA AAT AAG CTC CTG ACC ACQ GAG GCC TGA TTT CAA AGA TGT TAC CTG GGC AGG 837 

T8DCVC,M l ' T *ICQDVSHPMir r.o a 
ACA GAS GAC TGT GTG GGG AAC TTG ACC AGA ATT TGT CAA GAT GTT TCA AAT TTC ATG AAA III 

MA KMVRZtTTX»KTV IjMY I.T r» m ■»-. 
AAT GCC AAA AAT CTC AGG CTC ACC TAT CTG AAG ACT GTC CTG ATG TAC CTG CTC TGC ACA 957 

QH*TRRSG WS MYPTSS M»B» 
CAG AAT TAG ACG AGG CGA TCA GGT TGG TCA ATG TAT CCA ATC AGC AGT ATG GCC AGA TTC 10" 

S R* PGSTW-RT PPI WWRB .,._ 
TCC AGA TGA CCC GGA AGC ACT TGG AGG ACA CCG CCT ATC TGG TGG AGA AGA TGA GAG GGC 1077 

AAT TTG GCT GGG TGT CTG AAC TOG CAA ACC AGS CCC CAG AAA CAS AGA TCA TCT TTA ATT ll" 

QY R*PQaPMKBIPPMKMKO. *<>. 
CAA TAC AGG TAG TTC CAA GGA TTC ATG AAG GAA ATA TTT CCA AAC AAG ATG AAA CAA TGA All 

TGA CAG ACT TAA GCA TTC TGC CTT CCT CTA ATT TCA CAC TCA AGA TCC CTC TTG AAG AAA !257 

VL *VX.TS1.A. T. MQ KLY SIl4R 
OTG CTG AGA GTT CTA ACT TCA TTG GCT ACG TAG TGG CAA AAG CTC TAC AGC ATT TTA AGS »17 
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NXLKPGKKI + c 
AAC ATT TTA AAA CCT GGT AAG AAG ATC TAA TGC 

SSSGTWKS*NK 
TCT TCA TCT GGG ACC TOG AAA TCC TGA AAT AAA 

RKVC*LYTMKY 
AGG AAA GTA TGT TAG CTA TAT ACT ATG AAG TAC 

S Y*YSN*V KMK 
AGC TAT TAA TAC TCA AAT TGA GTT AAA ATG AAA 

MYYISWYISSS 
ATG TAT TAC ATT TCA TGG TAC ATT AGT AGT TCT 

h 
CTA 



ILYPVSRII 459 
ATC CTA TAT CCA GTA AGT AG A ATT ATC 1377 

KG*CNKHSC 479 
AAA GGA TAA TGC AAT AAA CAC AGT TGC 1437 

S * FTYVEWL 499 
TCT TAG TTT ACT TAT GTT GAA TGG CTT 1497 

I p P * K I K R N 519 
ATT CCT CCT TAA AAA ATC AAA CGT AAT 1557 

L Y I B *■ I LNH 539 
TTG TAT ATT GAA TAA ATA CTA AAT CAC 1617 

540 
1620 
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